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The conditions under which transactors can use the market 
(repeat-purchase) mechanism of contract enforcement are exam- 
ined. Increased price is shown to be a means of assuring contractual 
performance. A necessary and sufficient condition for performance 
is the existence of price sufficiently above salvageable production 
costs so that the nonperforming firm loses a discounted stream of 
rents on future sales which is greater than the wealth increase from 
nonperformance. This will generally imply a market price greater 
than the perfectly competitive price and rationalize investments in 
firm-specific assets. Advertising investments thereby become a pos- 
itive indicator of likely performance. 


I. Introduction 


An implicit assumption of the economic paradigm of market ex- 
change is the presence of a government to define property rights and 
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comments. Jonathan Skinner also provided valuable research assistance. ‘The Founda- 
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Center for Research in Government Policy and Business, University of Rochester, 
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enforce contracts. An important element of the legal-philosophical 
tradition upon which the economic model is built is that without some 
third-party enforcer to sanction stealing and reneging, market ex- 
change would be impossible.' But economists also have long con- 
sidered “reputations” and brand names to be private devices which 
provide incentives that assure contract performance in the absence of 
any third-party enforcer (Hayek 1948, p. 97; Marshall 1949, vol. 4, p. 
xi). This private-contract enforcement mechanism relies upon the 
value to the firm of repeat sales to satisfied customers as a means of 
preventing nonperformance. However, it is possible that economic 
agents with well-known brand names and reputations for honoring 
contracts may find it wealth maximizing to break such potentially 
long-term exchange relationships and obtain a temporary increase in 
profit. In particular, the determinants of the efficacy of this market 
method of contract performance and therefore the conditions under 
which we are likely to observe its use remain unspecified. 

This paper examines the nongovernmental repeat-purchase 
contract-enforcement mechanism. To isolate this force, we assume 
throughout our analysis that contracts are not enforceable by the 
government or any other third party. Transactors are assumed to rely 
solely on the threat of termination of the business relationship for 
enforcement of contractual promises This assumption is most 
realistic for contractual terms concerning difficult-to-measure prod- 
uct characteristics such as the “taste” of a hamburger. However, even 
when the aspects of a contract are less complicated and subjective and 
therefore performance more easily measurable by a third party such 
as a judge, specification, litigation, and other contract-enforcement 
costs may be substantial. Therefore, explicit guarantees to replace or 
repair defective goods (warranties) are not costless ways to assure 
contract performance. Market arrangements such as the value of lost 
repeat purchases which motivate transactors to honor their promises 
may be the cheapest method of guaranteeing the guarantee. 

While our approach is general in the sense that the value of future 
exchanges can motivate fulfillment of all types of contractual prom- 


' Hobbes ([1651] 1955, pp- 89-90) maintains that “. . . he that performeth first, has 
no assurance the other will perform after; because the bonds of words are too weak to 
bridle men’s ambition, avarice, anger, and other Passions, without the fear of some 
coercive Power; which in the condition of here Nature, where all men are equal, and 
judges of the justness of their own fears cannot possibly be supposed.” 

2 This assumption is consistent with the pioneering work of Macaulay (1963), where 
reliance on formal contracts and the threat of explicit legal sanctions was found to be an 
extremely rare element of interfirm relationships. Macaulay provides some sketchy 
evidence that business firms prevent nonfulfillment of contracts by the use of effective 
nonlegal sanctions consisting primarily of the loss of future business. This “relational” 


nature of contracts has been recently emphasized by Macneil (1974), and also by 
Goldberg (1976) and Williamson (1979). 
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ises, we focus in this paper on contracts between producers and 
consumers regarding product quality. In order for a repeat-sale en- 
forcement mechanism to operate, we assume that the identity of 
firms is known by consumers* and that the government enforces 
property rights to the extent that consumers voluntarily choose whom 
to deal with and must pay for the goods they receive.t In addition, 
managers of firms are assumed to be wealth maximizing and to place 
no value on honesty per se. 

In Section II, the conditions are outlined under which firms will 
either honor their commitments to supply a high level of quality or 
choose to supply a quality lower than promised. In order to em- 
phasize the ability of markets to guarantee quality in the absence of 
any government enforcement mechanism, a simple model is pre- 
sented which assumes that consumers costlessly communicate among 
one another. Therefore, if a firm cheats and supplies to any indi- 
vidual a quality of product less than contracted for, all consumers in 
the market learn this and all future sales are lost. A major result of 
our analysis is that even such perfect interconsumer communication 
conditions are not sufficient to assure high quality supply. Cheating 
will be prevented and high quality products will be supplied only if 
firms are earning a continual stream of rental income that will be lost 
if low quality output is deceptively produced. The present discounted 
value of this rental stream must be greater than the one-time wealth 
increase obtained from low quality production. 

This condition for the “notorious firm” repeat-purchase mecha- 
nism to assure high quality supply is not generally fulfilled by the 
usual free-entry, perfectly competitive equilibrium conditions of price 
equal to marginal and average cost. It becomes necessary to distin- 
guish between production costs that are “sunk” firm-specific assets 
and those production costs that are salvageable (i.e., recoverable) in 
uses outside the firm. Our analysis implies that firms will not cheat on 


3 Nonidentification of firm output leads to quality depreciation via a standard exter- 
nality argument; i.e., supply by a particular firm of lower than anticipated quality 
imposes a cost through the loss of future sales not solely on that firm but on all firms in 
the industry (see Akerlof 1970; Klein 1974). 

4 For simplicity, we assume that “theft,” as opposed to nonfulfillment of contract, is 
not possible. While “fraud,” in the sense of one party to the transaction intentionally 
supplying less than ‘contracted for, is analytically similar to “theft,” we draw a distinction 
along this continuum by assuming that the government only permits “voluntary” 
transactions in the sense that transactors choose whom to trade with. Therefore, while 
consumers cannot “steal” goods, they can, in principle, pay for the goods they receive 
with checks that bounce; and while firms cannot rob consumers, they can, in principle, 
supply goods of lower than promised quality. Although we recognize the great 
difficulty in practice of separating the underlying government enforcement mecha- 
nisms, e.g., property law, from the private promise-enforcing mechanisms we are | 
attempting to analyze, this distinction between theft and fraud is analytically unam- 
biguous. 
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promises to sell high quality output only if price is sufficiently above 
salvageable production costs. While the perfectly competitive price 
may imply such a margin above salvageable costs, this will not neces- 
sarily be the case. The fundamental theoretical result of this paper is 
that market prices above the competitive price and the presence of 
nonsalvageable capital are means of enforcing quality promises.° 

In Section III our theoretical model of quality-guaranteeing price 
premiums above salvageable costs is extended to examine how the 
capital value of these price-premium payments can be dissipated in a 
free-entry equilibrium. The quality-guaranteeing nature of nonsal- 
vageable, firm-specific capital investments is developed. Alternative 
techniques of minimizing the cost to consumers of obtaining an as- 
sured high quality are investigated. We also explore market responses 
to consumer uncertainty about quality-assuring premium levels. Ad- 
vertising and other production and distribution investments in “con- 
spicuous” assets are examined as competitive responses to simultane- 
ous quality and production-cost uncertainties. Finally, a summary of 


the analysis and some concluding remarks are presented in Section 
IV. 


II. Price Premiums and Quality Assurance 


Assume initially that consumers costlessly know all market prices and 
production technologies but not the qualities of goods offered for 
sale. For simplicity, the good being considered is assumed to be 
characterized by a single objective quality measure, q, where quality 
refers to the level of some desirable characteristic contained in the 
good. Examples are the quietness of appliance motors, the wrinkle- 
free or colorfast properties of clothing, or the gasoline mileage of an 
automobile. We also assume that the economy consists of consumers 
who consider buying a product x each period, where the length of a 
period is defined by the life (repurchase period) of product x, and 
who are assumed to costlessly communicate quality information 
among one another. Therefore, if a particular firm supplies less- 


° The notion that an increased price can serve as a means of assuring high quality 
supply by giving the firm a rental stream that will be lost if future sales are not made is 
not new. Adam Smith ([1776] 1937, p. 105) suggested this force more than 200 years 
ago when he noted that “the wages of labour vary according to the small or great trust 
which must be reposed in the workman. The wages of goldsmiths and jewellers are 
everywhere superior to those of many other workmen, not only of equal, but of much 
superior ingenuity; on account of the precious metals with which they are intrusted. We 
trust our health to the physician; our fortune and sometimes our life and reputation to 
the lawyer and attorney. Such confidence could not safely be reposed in people of a 
very mean or low condition.” Similar competitive mechanisms recently have been 
analyzed by Becker and Stigler (1974) and Klein (1974). 
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than-contracted-for quality to one consumer, the next period all con- 
sumers are assumed to know. In addition, this information is assumed 
not to depreciate over time.® 

Identical technology is assumed to be available to all entrepreneurs. 
Hence, there are many potential firms with identical total cost func- 
tions, C = c(x,g) + F(q), where F is fixed (invariant to rate) costs. 
Higher quality and larger quantities require higher production costs, 
F, > 0,¢, > 0, ¢, > 0, and marginal cost is assumed to increase with 
quality, ¢rq > 0. Fixed costs are assumed initially to be expenditures 
made explicitly each period rather than capital costs allocated to the 
current period. For example, they may include a payment on a 
short-term (one-period) rental agreement for a machine but not the 
current forgone interest on a purchased machine or the current 
period’s payment on a long-term rental agreement—both of which 
imply long-term and hence capital commitments. 

We therefore are explicitly distinguishing between “fixed” costs in 
the sense employed here of constant (invariant to output) current 
costs and “sunk” (nonsalvageable) capital costs. The usual textbook 
proposition that a firm will not shut down production as long as price 
is greater than average variable cost blurs this distinction and im- 
plicitly assumes that all fixed costs are also sunk capital costs. Our 
assumption of the complete absence of any long-term commitments is 
analytically equivalent to perfect salvageability of all capital assets. If 
all long-term production-factor commitments were costlessly revers- 
ible, that is, all real and financial assets such as the machine or the 
long-term machine rental contract could be costlessly resold and 
hence perfectly salvageable, there also would not be any capital costs. 
Only the nonsalvageable part of any long-term commitment should 
be considered a current sunk capital cost. 

If buyers are costlessly informed about quality, the competitive 
price schedule, P,., for alternative quality levels is given by the 
minimum average production costs for each level of quality and is 
designated by P, = P..(q). This is represented in figure 1 for two 
alternative quality levels, g, and qmin; by the prices P, and Py. Suppose, 
however, that the quality of product x cannot be determined costlessly 
before purchase. For simplicity, assume prepurchase inspection re- 
veals only whether quality is below some minimum level, gmin, and that 


6 If we modify the assumptions of our model to make interconsumer communication 
less than perfect and allow inflows of new ignorant consumers over time and per- 
mit individuals to forget, the potential short-run cheating gain by firms would be 
increased, Therefore, the quality-assuring price premium would be higher than we 
derive below. In this case increased firm size, by making it more likely that the 
individuals one is sharing product-quality information with (e.g., family and friends) 
have purchased from the same firm, lowers the potential short-run cheating gain by 


essentially reducing the repurchase period. 
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Fic 1.—Pricing and production of alternative quality levels 


the costs are prohibitive of determining whether quality is above Gein 
prior to purchase.’ Obviously, whenever the market price that con- 
sumers will pay for asserted high quality exceeds the cost of produc- 
ing minimum quality output, the firm can increase its initial period 
profits by producing the minimum quality output and deceptively 
selling it as a higher quality product. 

If producers are to have an incentive to produce high quality 
products (in the absence of governmentally enforceable contracts), 
consumers must somehow reward high quality production and 
punish low quality production. We assume in this competitive 
framework that consumers will purchase from particular sellers 
randomly chosen from the group of homogeneous sellers over which 
consumer information is transmitted. If a consumer receives a prod- 
uct of a quality at least as high as implicitly contracted for, he will 
continue to purchase randomly from this group of sellers. On the 
other hand, if quality is less than contracted for, all consumers cease 
to purchase from the particular sampled “cheating” firm. 


7 The quality of the good beyond the minimum level is therefore what Nelson (1970) 
has labeled as an “experience” characteristic. Making the minimum quality level en- 
dogenous does not substantially change the following analysis. 
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Consider now an initial “competitive” equilibrium in which a single 
firm contemplates selling a quality below that expected by customers. 
Given the competitive market price for some high quality, P.(qn) = Pi 
in figure 1, this particular firm could increase its initial period quasi 
rents by producing minimum quality and selling it at the high quality 
price. However, since buyers are assumed to communicate fully with 
one another, all future customers of high quality output, that is, sales 
at prices greater than P,(qmin) = Po in figure 1, are lost. That is, a firm 
that cheats will become known as a “notorious” cheater, and consum- 
ers will not purchase from the firm any product the quality of which 
cannot be determined prepurchase.® 

Whether sales of high or minimum quality will maximize the firm’s 
wealth depends on whether the capital value of future quasi rents 
from continued high quality production exceeds the differential ini- 
tial period quasi rents from quality depreciation. In terms of figure 1, 
at the perfectly competitive price for high quality output, P,, price is 
equal to the average costs of high quality production. Therefore, the 
quasi rents from continued high quality production are zero. i 
alternatively, the firm were to deceptively produce minimum quality 
output, as a price taker it would expand its production to x; (where P, 
= MC,,,,) and receive a one-period quasi rent, the present value of 
which is equal to: 


] 


COE Tp 


£3 
(Py = Pots— J IMCogg(t) — Pals} (1) 
Therefore, at the perfectly competitive price for any quality above 
min firms will always cheat consumers and supply qmin. 

Faced with this possibility, consumers would recognize that re- 
gardless of producers’ promises they will not obtain the higher quality 
product. Therefore, consumers would be willing to pay only the 
costless information price of the minimum quality product whose 
quality they can verify prepurchase, Py. Because of such rational 
consumer anticipations, firms will not be able to cheat, but desired 
high quality output will not be supplied. 

There may, however, be a price higher than the perfectly compe- 
titive price of high quality output, P, that if it were the equilibrium 
market price would (a) motivate honest production of the high quality 
good and (b) not completely dissipate the consumers’ surplus from 
purchase of higher quality. Consider a price such as P2 in figure 1. A 


8 A terminated firm cannot begin business in this industry under a new name. 
However, the highest valued alternative use of the entrepreneurial skills is included in 
salvageable fixed production costs. The firm considered here is assumed to face the 
same opportunities elsewhere as the firms that are honest in production of x. There- 
fore, the cheating firm can elect to enter a new industry. 
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firm supplying high quality output will now expand its production to 
x. The price premium P, defined as the increase in the price above 
minimum average cost of high quality, provides firms supplying high 
quality with a perpetual stream of quasi rents the present value of 
which (assuming unchanging cost and demand conditions over time) 
is equal to: 


ve * [Pr 4 J tater u P de (2) 


The price premium also increases the gains to a firm from supply- 
ing minimum quality at the high price. A firm that chooses to cheat 
will now expand its output (in terms of fig. 1 to x4) and earn the extra 
premium on all units sold.? Therefore, the capital value of the quasi 
rents from supplying quality less than promised is: 


W = TL —[IP + (Pr — Palle — J “IMCapig(®) ~ Peles}. 


Li 
A firm will honor its implicit quality contract as long as the differ- 
ence between the capital values of the noncheating and cheating 
strategies, W, — Ws, is positive. Consider the quasi-rent flow of the 
cheating and noncheating alternatives, that is, the terms in braces in 
our expressions (2) and (3). Define QR, equal to rW, and QR; equal to 
(1 + r)W;. A firm will then elect not to cheat if and only if: 


ieee Os 
pe teo 


Therefore, there will be a price premium that motivates firms to 
honestly produce high quality as long as: 


lim a < — (6) 


Using L’Hospital’s rule, equation (5) will be satisfied as long as 


1G. 3) 
i Xo 


(6) 


for all P > P;, where Ps is some finite price. Intuitively, as the price 
increases it is only the increase in quasi rents on the additional units of 
minimum quality output that favors the deceptive strategy. Equation 
(6) insures that price increases beyond some level increase W, more 
than W; such that eventually W, is greater than Ws. 


* Note that although %, may be greater or less than x3, depending on the price 
premium chosen, given upward-sloping supply functions and the condition that 
MCa,(q) > MCanin(Q) for all q, it must be the case that x, > x, > xy. 
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The condition specified in equation (6) is quite reasonable. It will be 
satisfied as long as a cheating firm does not accompany cheating with 
very large output increases. If, for example, the real interest rate 
were .05 we require only that the output increase by a cheating firm 
not be more than 20 times the total output that would be produced if 
the firm were not cheating. Hence, under very general cost conditions 
a price premium will exist that motivates competitive firms to honor 
high quality promises because the value of satisfied customers exceeds 
the cost savings of cheating them.’° 

While we cannot state broad necessary conditions for the form the 
cost function must take to imply the existence of a quality-assuring 
price, “reasonable” sufficient conditions can be stated. In particular, 
all cases of vertically parallel marginal cost curves, as illustrated in 
figure 1, where quality is produced by a fixed input not subject to 
decreasing returns to scale (such as the use of a better motor) and 
where the second derivative of marginal cost is greater than or equal 
to zero imply the existence of a quality-assuring price. The Appendix 
contains simulation results under the more unrealistic assumption of 
isoelastic marginal cost functions. These simulations indicate the ex- 
ceptional nature of the conditions when equation (6) is not satisfied. 
When a quality-assuring price does not exist, the cost functions are 
generally such that at reasonable premiums cheating output would be 
expanded by very large factors (often factors of many thousands). 
Since marginal cost functions for most products can be expected to 
become quite steep if not vertical at output expansions of much less 
than these factors, a quality-assuring price premium can generally be 
expected to exist. 

Throughout the remainder of the paper we assume the existence of 
a quality-assuring price. For given cost and demand conditions, the 
minimum quality-assuring price will depend upon the level of quality 


10 The potential function of price premiums as quality guarantors is also applicable to 
markets in which firms face downward-sloping demands. In this case, the inability of 
firms to increase sales without reductions in price limits the gains available from 
deceptive minimum quality production as price increases. The existence of a price 
sufficient to guarantee quality now depends on the elasticity of demand in addition to 
the cost savings from quality reductions at various quantities. In addition, when price- 
searching firms do not have stable future demands, consumer knowledge of cost and 
current demand conditions is not sufficient to estimate the quality-assuring price. The 
anticipated future demand vis-a-vis current demand is also relevant. For example, 
where consumers expect a growing demand for the output of a firm that continues to 
produce high quality output, the rate of quasi-rent flow from high quality (or future 
deceptive minimum quality) production increases over time. As compared to a firm 
with the same initial but constant demand, the growing firm will receive a larger capital 
value return at any price from high quality production in the initial period. Firms 
facing expected demand growth will therefore require smaller quality-assuring price 
premiums. See Klein, McLaughlin, and Murphy (1980) for an analysis of the less than 
perfectly elastic firm demand case. 
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considered and is denoted by P* = P*(q, qmin, 7). Our analysis implies 
that the quality-assuring price will increase as quality increases, as 
minimum quality decreases (for all q greater than qmin), and as the 
interest rate increases. These conditions are consistent with the 
familiar recognition that, given a particular quality level, quality- 
cheating problems are less severe the higher the level of quality that 
can be detected prepurchase and the shorter the period of repur- 
chase.!! 

Intuitively, the quality-assuring price treats the potential value of 
not producing minimum quality as an explicit opportunity cost to the 
firm of higher quality production. Hence the quality-assuring price 
must not only compensate the firm for the increased average produc- 
tion costs incurred when quality above that detectable prior to pur- 
chase is produced, but must also yield a normal rate of return on the 
forgone gains from exploiting consumer ignorance. This price “pre- 
mium” stream can be thought of as “protection money” paid by 
consumers to induce contract performance. Although the present 
discounted value of this stream equals the value of the short-run gain 
the firm can obtain by cheating, consumers are not indifferent be- 
tween paying the “premium” over time or permitting the firm to 
cheat. The price “premium” is a payment for high quality in the face 
of prepurchase quality-determination costs. The relevant consumer 
choice is between demanding minimum quality output at a perfectly 
competitive (costless information) price or paying a competitive price 


"' We can complicate our model by dropping the assumption that nondeceiving firms 
are anticipated to produce forever. If firms have a finite life, and the last period of 
production is known by both firms and consumers, there will be no premium sufficient 
to guarantee quality. No matter how high the premium paid by consumers for a high 
quality good in the last period, firms will supply “deceptive” minimum quality because 
there are no future sales to lose. Consumers aware of the last period will therefore 
demand only the minimum quality in that period. But then the next to the last period 
becomes the last period in the sense that, independent of the price premium, firm 
wealth is maximized by supplying minimum quality and going out of business. Con- 
sumers will then only pay for minimum quality output in the next to last period, and so 
on. High quality will never be produced. However, the necessary unraveling of the 
premium solution to assure high quality requires prior consumer knowledge of a date 
beyond which the firm will not produce. If consumers merely know that firms have 
finite lives but cannot with certainty specify a date beyond which a particular firm will 
not exist, price premiums may assure quality. While consumers are aware that some 
transactions will be with firms in their last period and hence cheating will occur, the 
expected gain from purchasing high promised quality can be positive. Our price 
premium—repeat business quality enforcement mechanism is analytically equivalent in 
form to the “super-game” solutions to the prisoner’s dilemma problem developed in the 
game-theory literature. A general result of that analysis is that a cooperative solution 
can exist if one assumes either an infinitely long super game (as we have assumed in our 
model), or a super game of finite length but with transactors who have sufficient 
uncertainty regarding the period when the super game will end (see, e.g., Luce and 
Raiffa [1957], pp. 97-102, or, for a more recent solution to the problem that is similar 
in emphasis to our approach, Telser [1980]). 
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“premium,” which is both necessary and sufficient, for higher quality 
output.” 

There is a possibility that the required quality-guaranteeing price 
premium may exceed the increased consumer surplus of purchasing 
higher quality rather than the minimum quality product. If consum- 
ers can easily substitute increased quantity of the low quality product 
for increased quality, then the value of guaranteed high quality will be 
relatively low. Therefore, although a quality-guaranteeing price 
exists, a higher than minimum quality product may not be produced. 
For those goods where the substitution possibilities between quality 
and quantity are lower (e.g., drugs), consumer demand for confi- 
dence will be relatively high and the high quality guarantee worth the 
price premium. We assume throughout that we are dealing with 
products where some demand exists for the high quality good in the 
range of prices considered. 


III. Competitive Market Equilibrium: Firm-specific Capital 
Investments 


Our analysis has focused on the case where costless information 
(perfectly competitive) prices do not imply sufficient firm-specific 
rents to motivate high quality production. A price premium was 
therefore necessary to induce high quality supply. Thus, if price 
assures quality, the firms producing quality greater than qmin appear 
to earn positive economic profits. However, this cannot describe a full 
market equilibrium. When the price is high enough to assure a par- 
ticular high level of quality, additional firms have an incentive to enter 
the industry. But if additional firms enter, the summation of the 
individual firms’ outputs will exceed the quantity demanded at the 
quality-assuring price. Yet this output surplus cannot result in price 
reductions since the quality-assuring price is, in effect, a minimum 
price constraint “enforced” by rational consumers. All price-taking 
firms supplying a particular promised quality q above the minimum 
face a horizontal demand curve at P* = P*(q), which is of an unusual 
nature in that prices above or below P* result in zero sales. Consumers 
know that any price below P* for its associated quality results in the 


12 As opposed to the Darby and Karni (1973) analysis, this particular model implies 
an equilibrium quantity of “fraud” equal to zero, where fraud is the difference between 
anticipated and actual quality. Given the symmetrical information assumptions re- 
garding cost functions, parties to a contract know when and by how much a contract will 
be broken. An unanticipated broken quality contract is therefore not possible. The 
implicit economic (as opposed to common usage) concept of “contract” refers to 
anticipated outcomes and not to verbal promises or written agreements; thus there will 
be no broken quality “contracts.” 
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supply of qmin. They therefore will not purchase from a firm promis- 
ing that quality at a price lower than P*. 


A. Brand Name Capital Investments 


Competition to dissipate the economic profits being earned by exist- 
ing firms must therefore occur in nonprice dimensions. However, the 
zero-profit equilibrium is consistent with only a very particular form 
of profit-absorbing nonprice competition. The competition involves 
firm-specific capital expenditures. This firm-specific capital competition 
motivates firms to purchase assets with (nonsalvageable) costs equal to 
the capital value of the premium rental stream earned when high 
quality is supplied at the quality-assuring price. That is, if P*(q) is not 
to increase, the investment leading to zero profits must be highly firm 
specific and depreciate to zero if the firm cheats and supplies qmin 
rather than the anticipated quality. Such firm-specific capital costs 
could, for example, take the form of sunk investments in the design of 
a firm logo or an expensive sign promoting the firm’s name. Ex- 
penditures on these highly firm-specific assets are then said to repre- 
sent brand name (or selling) capital investments. 

The competitive process also forces the firm-specific capital invest- 
ments to take the form of assets which provide the greatest direct 
service value to consumers. The consumers’ “effective” price of pur- 
chasing a quality-assured good, where the effective price is defined as 
the purchase price of a product, P*, less the value of the services 
yielded by the jointly supplied brand name assets, is thereby 
minimized. Competition among firms in seeking and making the most 
highly valued firm-specific brand name capital investments will occur 
until the expected wealth increase and, therefore, the incentive to 
enter the industry are eliminated. 

If the firm decides to cheat it will experience a capital loss equal to 
its anticipated future profit stream. Since P*(q) is derived so that the 
threat of loss of this future profit stream motivates guaranteed quality, 
the zero-profit equilibrium “brand-name capital,” B, which serves as 


an explicit hostage to prevent cheating, equals, in terms of figure 1, 
where P*(q,) = Po, 


[Ps = (AC )olxe_ 


r 


B= (7) 
That is, the market value of the competitive firm’s brand name capital 
is equal to the value of total specific or “sunk” selling costs made by the 
firm which, in turn, equals the present value of the anticipated pre- 
mium stream from high quality output. If we continue to assume that 
there are no capital (and therefore “sunk”) costs of production, the 
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zero-profit equilibrium is shown in figure 1 where average “total” cost 
(which includes average production costs, AC,,, plus average brand- 
name capital [i.e., nonsalvageable “selling”] costs, r[B/x]) just equals 
price, Py». 

What assures high quality supply is the capital loss due to the loss of 
future business if low quality is produced. Since the imputed value of 
the firm’s brand name capital is determined by the firm’s expected 
quasi rents on future sales, this capital loss from supplying quality 
lower than promised is represented by the depreciation of this firm- 
specific asset. The expenditures on brand name capital assets are 
therefore similar to collateral that the firm loses if it supplies output of 
less than anticipated quality and in equilibrium the premium stream 
provides only a normal rate of return on this collateral asset. 

Note that the “effective” price paid by consumers, which equals the 
quality-assuring price less the value of the consumer services yielded 
by the brand name capital, may be minimized by the investment in 
specific selling assets with some positive salvage value. Even though 
this results in an increased quality-guaranteeing price, assets with 
positive salvage values may yield differentially large direct consumer 
service flows. All brand name capital assets must, however, satisfy a 
necessary condition that the salvage value per unit of output be less 
than the consumer service value. Firms competing to minimize the 
effective price will choose specific assets by trading off increased 
consumer service value with decreased salvage value. This may ex- 
plain why stores which supply high quality products often have 
amenities (such as a luxurious carpet cut to fit the particular store) 
even though only small direct consumer services may be yielded 
relative to cost.” 


B. Nonsalvageable Productive Assets 


The market equilibrium we have developed implies an effective price 
for high quality output that is higher than what would exist in a zero 
information cost world. While the costless-information solution 1S 
meaningless as an achievable standard of “efficiency,” alternative 
marketing arrangements may be usefully compared to this bench- 
mark. Viable, competitive firms will adopt the arrangements which, 
considering all transacting and contracting costs, minimize the devia- 


13 Tf the “sunk” asset yields absolutely no consumer services, then the firm will not use 
it. Even though profits would be eliminated by purchase of such an asset, consumers 
would be indifferent between a firm that invested in the asset and a firm that did not. In 
a world where consumers do not possess full knowledge of cost conditions, however, 
use of obviously specific assets may be employed even if yielding no direct consumer 
service flow because they may efficiently inform consumers regarding the sunk capital 
cost to the firm. This is discussed in greater detail in Sec. IIIC. 
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tions between the costless-information price and the effective price. 
One potentially efficient alternative or supplement to the pure price- 
premium method of guaranteeing quality may be the use of nonsal- 
vageable productive assets rather than brand name (selling) assets. 

In order to simplify the analysis of price premiums in guaranteeing 
quality, we have assumed that all production costs, including fixed 
costs, were noncapital costs and therefore, by definition, salvageable. 
More realistically, firms can control both the capital intensity of pro- 
duction and the salvage value of any fixed assets employed in the 
production process. In particular, if the firm uses a production pro- 
cess that has a nonsalvageable capital element, the normal rate of 
return (quasi-rent stream) on this element of production capital ef- 
fectively serves as a quality-assuring premium. In terms of our model, 
the capital value of the quasi-rent stream when a firm cheats (eq. [3]) is 
now modified so that the net gain from cheating equals W; minus this 
nonsalvageable capital cost. Alternatively, in the zero-profit equilib- 
rium the total level of collateral must still equal the potential gross 
gains from cheating, but part of the collateral is now provided by the 
nonsalvageable production assets rather than the brand name capital 
assets. 

For example, if a machine is somewhat illiquid, buying it rather 
than renting it short term provides some of this collateral and lowers 
the quality-guaranteeing price. In fact, because of positive selling 
costs, capital assets generally have a salvage value less than cost. Thus 
capital inputs, especially those that have a high turnover cost, will 
have a value in terms of providing quality assurance in addition to 
their productive value. Even if the asset is not firm specific, if there is 
any time delay after the firm cheats and is terminated by consumers in 
selling the asset to another firm, the firm loses the real rate of interest 
for that time period on the capital. In addition to physical capital, 
human capital costs, especially entrepreneurial skills, are also often 
highly nonsalvageable in the face of consumer termination and there- 
fore also provide significant quality assurance. 

The general theoretical point is that the presence of positive 
quality-information costs favors an increase in the capital intensity of 
production, including the extent of long-term, illiquid contractual 
arrangements with suppliers of productive inputs. In particular, the 
minimum-cost production technique is no longer necessarily that 
which minimizes solely the average cost of production. “Sunk” pro- 
duction capital now accomplishes two functions—the supply of pro- 
duction services and the supply of quality-assuring services. Therefore, 
increases in average production costs accompanied by larger increases 
in sunk production assets may minimize the effective consumer prod- 
uct price. Profit maximization requires firms to trade off “inefficient” 
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production technologies and the quality-assurance cost savings im- 
plied by the presence of firm-specific (sunk) capital assets in the 
productive process and hence the reduced necessity for the firm to 
make sunk selling cost (brand name capital) investments. Although 
the more capital intensive production technology may increase the 
perfectly competitive price of high quality output, Po, it reduces the 
price premium, P, — P,, necessary to assure the supply of that high 
quality. In fact, even a very slight modification of the minimum 
production cost technology, such as an alteration in some contractual 
terms, may imply the existence of large enough nonsalvageable assets 
so that the need for a quality-guaranteeing price premium is elimi- 
nated entirely.* 


C. Consumer Cost Uncertainty: A Role for Advertising 


The discussion to this point has assumed complete consumer knowl- 
edge of firms’ costs of producing alternative quality outputs and 
knowledge of the extent to which any capital production costs or 
brand name capital selling costs are salvageable. This knowledge is 
necessary and sufficient to accurately calculate both the quality- 
guaranteeing premium and price. However, consumers are generally 
uncertain about cost conditions and therefore do not know the 
minimum quality-guaranteeing price with perfect accuracy. In fact, 
consumers cannot even make perfect anticipated quality rankings 
across firms on the basis of price. That one firm has a higher price 
than another may indicate a larger price premium or, alternatively, 
more inefficient production. In this section, we examine how the 
more realistic assumption of consumer cost uncertainty influences 
market responses to prepurchase quality uncertainty. 

We have shown that increases in the price premium over average 
recoverable cost generally increase the relative returns from produc- 
tion of promised (high) quality rather than deceptive minimum (low) 
quality. The existence of a high price premium also makes ex- 
penditures on brand name capital investments economically feasible. 
The magnitude of brand name capital investments in turn indicates 


4 For example, franchisers can assure quality by requiring franchisee investment in 
specific production capital. A general arrangement by which this is accomplished is by 
not permitting the franchisee to own the land upon which its investments (e.g., capital 
fixtures) are made. Rather, the franchiser owns or leases the land and leases or 
subleases it to the franchisee, thereby creating for the franchisee a large nonsalvageable 
asset if he is terminated by the franchiser. This highly franchiser-specific asset can 
therefore serve as a form of collateral and potentially eliminate any need for a price 
premium. See Klein (1980) for a more complete discussion of this franchising solution, 
including the potential reverse cheating problem that is created by such contractual 
arrangements. 
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the magnitude of the price premium. When a consumer is uncertain 
about the cost of producing a particular high quality level of output 
and therefore the required quality-assuring premium, information 
about the actual level of the price premium will provide information 
about the probability of receiving high quality. If consumers are risk 
averse, this uncertainty about receiving anticipated high or decep- 
tively low quality output will increase the premium that will be paid. 
The premium will include both a (presumably unbiased) estimate of 
the quality-assuring premium and an extra payment to reduce the 
risk of being deceived. 

Thus, when consumers do not know the minimum quality- 
guaranteeing price, the larger is a firm’s brand name capital invest- 
ment relative to sales, the more likely its price premium is sufficient to 
motivate high quality production. Competitive investment in brand 
name capital is now no longer constrained to assets which yield direct 
consumer service flows with a present discounted value greater than 
the salvage value of the assets. Implicit information about the suffi- 
ciency of price as a guarantee can be supplied by “conspicuous” 
specific asset expenditures. Luxurious storefronts and ornate displays 
or signs may be supplied by a firm even if yielding no direct consumer 
service flows. Such firm-specific assets inform consumers of the mag- 
nitude of sunk capital costs and thereby supply information about the 
quasi-rent price-premium stream being earned by the firm and hence 
the opportunity cost to the firm if it cheats. Both the informational 
services and the direct utility producing services of assets are now 
relevant considerations for a firm in deciding upon the most valuable 
form the brand name capital investment should take. 

The value of information about the magnitude of a firm’s specific 
or “sunk” capital cost, and therefore the magnitude of the price 
premium, is one return from advertising. Indeed, the role of pre- 
miums as quality guarantors provides foundation for Nelson’s (1974) 
argument that advertising, by definition, supplies valuable informa- 
tion to consumers—namely, information that the firm is advertising. 
A sufficient investment in advertising implies that a firm will not 
engage in short-run quality deception since the advertising indicates a 
nonsalvageable cost gap between price and production costs, that is, 
the existence of a price premium. This argument essentially reverses 
Nelson’s logic. It is not that it pays a firm with a “best buy” to advertise 
more, but rather that advertising implies the supply of “best buys,” or 
more correctly, the supply of promised high quality products. Adver- 
tising does not directly “signal” the presence of a “best buy,” but 
“signals” the presence of firm-specific selling costs and therefore the 
magnitude of the price premium. We would therefore expect, ceteris 
paribus, a positive correlation not between advertising intensity and 
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“best buys,” as Nelson claims, but between advertising intensity and 
the extent of quality that is costly to determine prepurchase.” 

Conspicuous sunk costs such as advertising are, like all sunk costs, 
irrelevant in determining future firm behavior regarding output 
quality. However, consumers know that such sunk costs can be 
profitable only if the future quasi rents are large. In particular, if the 
consumer estimate of the initial sunk expenditure made by the firm is 
greater than the consumer estimate of the firm’s possible short-run 
cheating gain, then a price premium on future sales sufficient to 
prevent cheating is estimated to exist. Our analysis therefore implies 
that independent of excludability or collection costs, advertising that 
guarantees quality will be sold at a zero price and “tied in” with the 
marked-up product being advertised.'® 

Our theory also suggests why endorsements by celebrities and other 
seemingly “noninformative” advertising such as elaborate (obviously 
costly to produce) commercials, sponsorships of telethons, athletic 
events, and charities are valuable to consumers. In addition to draw- 
ing attention to the product, such advertising indicates the presence 
of a large sunk “selling” cost and the existence of a price premium. 
And because the crucial variable is the consumers’ estimate of the 
stock of advertising capital (and not the flow), it also explains why 
firms advertise that they have advertised in the past (e.g., “as seen on 
‘The Tonight Show’”). Rather than serving a direct certifying func- 
tion (e.g., as recommended by Good Housekeeping magazine), informa- 
tion about past advertising informs consumers about the magnitude 
of the total brand name capital investment.” 

Firms may also provide valuable information by publicizing the 


18 Nelson’s argument is based on an assumption similar to the Spence (1973)-type 
screening assumption regarding the lower cost to more productive individuals of 
obtaining education. Nelson’s argument, however, is circular since consumers react to 
advertising only because the best buys advertise more and the best buys advertise more 
only because consumers buy advertised products. Schmalensee (1978) has shown that 
the Nelson scenario may imply “fly-by-night” producers who advertise the most and 
also deceptively produce minimum quality. Like Spence’s signaling model, the govern- 
ment could, in principle, tax this investment and thereby save real resources without 
reducing the effectiveness of this information if consumers were aware of the tax rate. 
However, advertising serves many purposes. In particular, advertising also can supply 
valuable consumer information about the particular characteristics and availability of 
products. For optimality the government would therefore have to determine the 
appropriate tax rate for each advertising message and consumers would have to be 
aware of each of these particular tax rates. 

16 Mishan (1970) has argued for legislation which would require advertising to be sold 
separately at a price which covers advertising costs. This would completely destroy the 
informational value of advertising we are emphasizing here. 

17 Note, however, that just as firms may deceive consumers about quality to be 
supplied, they may also attempt to deceive them about the magnitude of the advertising 
investments made, e.g., purchasing a local spot on “The Tonight Show” and advertising 
the advertising as if an expenditure on a national spot was made. 
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large fees paid to celebrities for commercials. Information about large 
endorsement fees would be closely guarded if the purpose were to 
simulate an “unsolicited endorsement” of the product’s particular 
quality characteristics rather than to indicate the existence of a price 
premium. Viewed in this context, it is obviously unnecessary for the 
celebrity to actually use the particular brand advertised. This is con- 
trary to a recent FTC ruling (see Federal Trade Commission 1980). 

This analysis of advertising implies that consumers necessarily re- 
ceive something when they pay a higher price for an advertised 
brand. An expensive name brand aspirin, for example, is likely to be 
better than unadvertised aspirin because it is expensive. The adver- 
tising of the name brand product indicates the presence of a current 
and future price premium. This premium on future sales is the firm’s 
brand name capital which will be lost if the firm supplies lower than 
anticipated quality. Therefore, firms selling more highly advertised, 
higher priced products will necessarily take more precautions in 
production.'® 

We have emphasized the informational value of advertising as a 
sunk cost. Other marketing activities can serve a similar informational 
role in indicating the presence of a price premium. For example, free 
samples, in addition to letting consumers try the product, provide 
information regarding future premiums and therefore anticipated 
quality. Such free or low-price samples thus provide information not 
solely to those consumers that receive the samples but also to anyone 
aware of the existence and magnitude of the free or low-price sample 
program. More generally, the supply by a firm of quality greater than 
anticipated and paid for by consumers is a similar type of brand name 
capital investment by the firm. By forgoing revenue, the firm provides 
information to consumers that it has made a nonsalvageable invest- 
ment of a particular magnitude and that a particular future premium 
stream is anticipated to cover the initial sunk alternative cost.!9 


'** The greater is the cost to consumers of obtaining deceptively low quality, the 
greater will be the demand for quality assurance. The very low market share of 
“generic” children’s aspirin (1 percent) vis-a-vis generic’s share of the regular aspirin 
market (7 percent) is consistent with this implication (see IMS America, Ltd. 1978). 
Many individuals who claim “all aspirin is alike” apparently pay the extra price for their 
children where the costs of lower quality are greater and therefore quality assurance is 
considered more important. 

Our analysis of advertising also illuminates the monopolistic competition debate. 
Chamberlin’s (1965) distinction between production costs, defined as what is included 
in the “package” that passes from seller to buyer, and selling costs (e.g., advertising), 
which are not part of the package transferred, suggests that selling costs usefully may 
be considered as a privately supplied collective factor. For example, a firm which holds 
selling costs, such as expenditures on a store sign, constant as his sales increase does not 
appear to be decreasing the average “quality” of his product. Demsetz (1959, 1968) 
made the contrary assumption that average quality does fall as sales increase, holding 
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Finally, even when consumers systematically underestimate the 
quality-assuring price because of downward-biased estimates of 
production or marketing costs or upward-biased estimates of antici- 
pated demand growth, firms in a monopolistically competitive envi- 
ronment may not cheat. Such price-setting firms may possess specific 
nonsalvageable assets (such as trademarks) upon which they are 
earning a sufficient quasi-rent premium to induce high quality sup- 
ply. However, the existence of independent competitive retailers that 
do not have any ownership stake in this firm-specific asset and yet can 
significantly influence the quality of the final product supplied to 
consumers creates a severe quality-cheating problem for the manu- 
facturer. In this context, rational but imperfectly informed consum- 
ers will not demand a sufficient premium to prevent retailer cheating. 
Manufacturers may protect their trademarks by imposing constraints 
on the retailer competitive process including entry restrictions, exclu- 
sive territorial grants, minimum resale price maintenance, and adver- 
tising restrictions that will assure quality by creating a sufficiently 
valuable premium stream for the retailers. If this manufacturer- 
created premium stream is greater than the potential short-run re- 
tailer return from deceptive low quality supply, the magnitude of 
which is determined in part by the manufacturer by its level of 
direct policing expenditures, the retailer will not cheat and the 
consumer will receive anticipated high quality supply.” 


IV. Conclusion 


We have shown that even the existence of perfect communication 
among buyers so that all future sales are lost to a cheating firm is not 
sufficient to assure noncheating behavior. We have analyzed the 


selling costs constant, by merely ignoring Chamberlin’s distinction and its possible 
theoretical significance and identifying quality costs with selling costs (aggregating both 
into the concept “demand increasing costs”). However, since in a monopolistically 
competitive environment the price premium that will assure quality depends upon the 
demand expected in the future, the quality incentive implied by an advertising invest- 
ment also depends upon consumers’ expectations about future demand. In particular, 
the relevant variable indicating an incentive to produce high quality is the level of 
advertising capital compared to anticipated future sales. Hence advertising is not a pure 
public good in a firm’s production function as Chamberlin implicitly assumed, and the 
arbitrary contrary assumption made by Demsetz is possibly justifiable. 

20 See Klein et al. (1980) for a complete analysis of this case applied to the FTC Coors 
litigation. Coors appears to have employed exclusive territories on the wholesale level 
and resale price maintenance on the retail level to create a sufficient premium to 
encourage the necessary refrigeration of their nonpasteurized beer. Implications of this 
analysis in terms of providing a possible rationale for similar constraints on the compe- 
titive process enforced by trade associations and government regulatory agencies are 
also examined. 
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generally unrecognized importance of increased market prices and 
nonsalvageable capital as possible methods of making quality prom- 
ises credible. We obviously do not want to claim that consumers 
“know” this theory in the sense that they can verbalize it but only that 
they behave in such a way as if they recognize the forces at work. They 
may, for example, know from past experience that when a particular 
type of investment is present such as advertising they are much less 
likely to be deceived. Therefore, survivorship of crude decision rules 
over time may produce consumer behavior very similar to what would 
be predicted by this model without the existence of explicit “knowl- 
edge” of the forces we have examined. 

Our analysis implies that consumers can successfully use price as an 
indicator of quality. We are not referring to the phenomenon of an 
ignorant consumer free riding on the information contained in the 
market price paid by other more informed buyers but rather to the 
fact that consumer knowledge of a gap between firm price and sal- 
vageable costs, that is, the knowledge of the existence of a price 
premium, supplies quality assurance. The former argument, that a 
naive buyer in a market dominated by knowledgeable buyers can use 
price as a quality signal because the relative market price of different 
products reflects differences in production costs and therefore differ- 
ences in quality, crucially depends upon a “majority” of the buyers in 
the market being knowledgeable. 

As Scitovsky (1945, p. 101) correctly notes, . the situation 
becomes paradoxical when price is the index by which the average 
buyer judges quality. In a market where this happens price ceases to 
be governed by competition and becomes instead an instrument 
wherewith the seller can influence his customer’s opinions of the 
quality of his wares.” However, even when the “average” buyer uses 
price as an index of quality, we need not fear, as Scitovsky does, the 
havoc this supposedly wreaks on the economic theory of choice. All 
consumers in a market may consistently use price, given their esti- 
mates of salvageable production costs, as an indicator of the firm’s 
price-premium stream and therefore as an indicator of the antici- 
pated quality of the output to be supplied by the firm. Scitovsky did 
not consider that price not only influences buyers’ expectations but 
also influences producers’ incentives. 

We do not wish to suggest that use of implicit (price premium— 
specific investment) contracts is always the cheapest way to assure 
quality supply. When quality characteristics can be specified cheaply 
and measured by a third party, and hence contract enforcement costs 
are anticipated to be low, explicit contractual solutions with gov- 
ernmentally enforced penalties (including warranties) may be a less 


““ 
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costly solution. When explicit contract costs are high and the extent of 
short-run profit from deceptively low quality supply and hence the 
quality-assuring price premium is also high, governmental specifica- 
tion and enforcement of minimum quality standards may be an alter- 
native method of reducing the costs of assuring the supply of high 
quality products.?’ And, finally, vertical integration, which in this 
consumer-product context may consist of home production or con- 
sumer cooperatives, may be a possible alternative arrangement (see 
Klein, Crawford, and Alchian 1978). 

The three major methods in which to organize transactions can be 
usefully considered within this framework as (a) explicit contractual 
or regulatory specification with third-party enforcement, ()) direct 
(two-party) enforcement of implicit contracts, and (c) one-party or- 
ganization or vertical integration. This paper has analyzed the brand 
name repeat-purchase mechanism represented by the second alter- 
native. More generally, however, all market transactions, including 
those “within” the firm such as employer-employee agreements, con- 
sist of a combination of the two basic forms of contractual arrange- 
ments. Some elements of performance will be specified and enforced 
by third-party sanctions and other elements enforced without invok- 
ing the power of some outside party to the transaction but merely by 
the threat of termination of the transactional relationship. 

Our analysis implies that, given a particular level of explicit contract 
costs, we are more likely to observe an increased reliance on the brand 
name contract-enforcement mechanism the lower the rate of interest 
and the lower the level of prepurchase quality-determination costs. 
The lower the interest rate the greater the capital cost to a firm from 
the loss of future sales and therefore the lower the equilibrium price 
premium. Hence we can expect the termination of future exchange 
method of enforcing contracts to be more effective. More generally, 
since the interest rate in our model refers to the period of product 
repurchase, the quality assurance will be less costly for less durable 
goods that have greater repurchase frequency. Franchising chains, 
for example, take advantage of this effect by making it possible for 


21 Such governmental regulations, however, do not avoid the contractual problems of 
ex ante explicitly defining in an enforceable manner all major elements of perfor- 
mance. Nor do they necessarily avoid the implicit contractual conditions of a price- 
premium stream (created by entry restrictions, an initial forfeitable bond, and/or 
minimum price restraints) to effectively enforce the governmental regulations (see 
Klein et al. 1980). In addition, by making it illegal to supply less than the regulated 
quality, individuals that would voluntarily demand lower quality than the regulated 
standard incur a loss of consumer surplus. Distribution effects are created, since while 
the regulation may decrease the cost of supplying high quality output it increases the 


cost of supplying lower quality output. 
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consumers to pool information from sales of seemingly disparate 
sellers, thereby decreasing the period of repurchase and the quality- 
assuring price. 

Similarly, purchase from a diversified firm increases the frequency 
of repeat purchase and lowers the necessary price premium. As long 
as consumers react to receiving unexpectedly low quality from a 
diversified firm by reducing purchases of the firm’s entire product 
line, all the firm’s nonsalvageable capital serves to assure the quality of 
each product it produces. This economy of scale in communicating 
quality-assurance information to consumers may be one motivation 
for conglomerate mergers. If a firm sells a set of products, each of 
which is produced by capital with salvage value less than costs, the 
quality-guaranteeing price premium on each product will be lower 
than if production were done by separate firms. 

Finally, we can expect greater reliance on the non-third-party 
method of contract enforcement the lower the direct costs to the 
consumer of determining quality of the product prepurchase. The 
higher the costs of producing the minimum quality output that can- 
not be distinguished prepurchase from a given promised high quality 
output and the faster these minimum quality production costs rise 
with increased output, the lower the potential short-run cheating gain 
and therefore the lower the price premium. When the low quality cost 
function is such that a cheating firm can expand output a substantial 
amount with little increase in cost, use of the brand name enforce- 
ment mechanism is unlikely. 

When the low quality cost function becomes so flat that the pre- 
mium solution does not exist, the implicit contract-enforcement 
mechanism we have analyzed will not be used. When this condition is 
combined with an extremely high cost of quality assurance via explicit 
contractual guarantees, governmental supply may be the cheapest 
alternative. An obvious example is the good “money,” where the 
marginal cost of production is essentially zero, the short-run cheating 
potential extremely large, and where the cost of a commodity money 
or the necessary bullion reserves to assure performance via converti- 
bility is also extremely high. Governmental supply is the generally 
adopted but far from costless solution (see Klein 1974). Other prod- 
ucts where the “hold-up” potential is very large and where explicit 
contract costs are high (such as police or fire protection services) are 
also generally supplied by non-profit-maximizing government agen- 
cies rather than by unregulated profit-maximizing firms earning large 
quasi rents on unsalvageable (firm-specific) capital assets. In general, 
minimization of the cost of assuring performance will imply an opti- 
mal combination of governmental regulation and/or supply, explicit 
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contractual enforcement, vertical integration, and the implicit (brand 
name) contractual enforcement mechanism we have analyzed. 


Appendix 
Simulation of the Quality-assuring Price 


If we assume that output of high and low (minimum) quality is produced by 
constant-elasticity cost functions of the form: 


Cy = Fr + Brxh (Al) 
C, = Fi + Bixt, (A2) 
the quality-assuring price premium, P, will be given by:” 
(a-D/a 
[2 as Fay | 
m WS 
CaP a A ema re (A3) 
1 _ r jb 1/(a—1) 
aes Bn 


This expression indicates that as the ratio of low to high quality fixed costs, 
(F,/F,), decreases the quality-guaranteeing price-premium increases (because 
the short-run profit from cheating increases). But, as long as F, can be 
assumed to be less than or equal to Fp, fixed costs cannot affect the existence 
of the quality-guaranteeing price premium. Similarly, as the interest rate, r, 
increases, the quality-guaranteeing price premium increases but will always 
exist. It is the marginal cost elasticity, [1/(a — 1)], and the ratio of the marginal 
cost slopes, (;/Bn), that determine the existence of a quality-assuring price 
premium. An increase in the elasticity of marginal cost or a decrease in the 
ratio of the low to high quality marginal cost slopes, by increasing the possible 
expansion of the low quality output at the high quality-guaranteeing price, 
increases the quality-guaranteeing price premium and the likelihood that it 
may not exist. Simulation results as a function of these parameters are pre- 
sented in table Al below. The ratio of the quality-assuring price to the 
minimum average cost of high quality production, P,/P in terms of figure 1, 
along with the ratio of low quality output at the quality-assuring price relative 
to the minimum high quality average cost output, x4/x, in terms of figure 1, is 
presented. When the quality-assuring price does not exist, the ratio of low 
quality output at a P,/P, ratio of 2 to the output at the minimum average cost 
of low quality output, xo, is presented in brackets to indicate the shape of the 
low quality cost function. The results indicate that these cases of nonexistence 
generally occur where the low quality cost curve is so flat relative to the high 
quality cost curve that cheating output can be expanded dramatically relative 
to the noncheating output. For example, when the marginal cost elasticity is 
assumed to be 10.0 and the ratio of marginal cost slopes is assumed to be 0.25, 
it implies that low quality output can be profitably expanded by more than a 
billion times beyond its minimum average cost rate when the market price is 
double the perfectly competitive high quality price. 


22 The derivation is available to readers upon request. 
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The Political Economy of Benefits and Costs: A 
Neoclassical Approach to Distributive Politics 





Barry R. Weingast, Kenneth A. Shepsle, and 


Christopher Johnsen 
Washington University 


This essay offers a rational political explanation for the notorious 
inefficiency of pork barrel projects with an optimization model of 
legislative behavior and legislative institutions. The model em- 
phasizes the (economically arbitrary, from a welfare point of view) 
importance of the geographic incidence of benefits and costs owing 
to the geographic basis for political representation. We explore the 
implications of a legislator’s objective function and derive conditions 
under which a representative legislature will select an omnibus of 
projects each of which exceeds the efficient scale. 


The inefficiency of public decision making has long been a concern 
for students of public policy. Models of bureaucratic behavior, legis- 
lative institutions, interest-group influence, vote-maximizing politi- 
cians, and fiscal illusion all point toward sources of bias in these 
nonmarket contexts.! Yet none has established why a cooperative 
legislature would stand for policies which are Pareto dominated. 
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Ferejohn and Fiorina 1975; Wagner 1976; Aranson and Ordeshook 1978: and Fiorina 
and Noll 1978. 
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Specifically, if the bias works to the benefit of some identifiable group, 
then why do political actors not insist on efficiency in combination 
with a compensation scheme? This would leave the identified group at 
least as well off, all others at least as well off, and then leave the 
efficiency gain at the political discretion of representatives, thereby 
enhancing their role. From the anomaly of sustained political ineffi- 
ciency, it appears that there is a divergence between normative eco- 
nomic principles, on the one hand, and the preferences of political 
actors on the other. Prominence is given in the following analysis to 
the political mechanisms that create and maintain this divergence. 

In this paper, we develop a model of the public choice mechanisms 
comprising a representative legislature in order to show the political 
sources that systematically bias public decisions toward larger than 
efficient projects in the area of distributive policymaking. By distribu- 
tive policies we mean those projects, programs, and grants that con- 
centrate the benefits in geographically specific constituencies, while 
spreading their costs across all constituencies through generalized 
taxation2 This collection of public decisions includes the traditional 
pork barrel of public works and rivers and harbors projects as well as 
the more recent examples of highway construction, categorical 
grants-in-aid, urban renewal, mass transit, and sewage treatment 
plants. The model identifies the political sources of efficiency bias— 
and there are several—by unpacking the democratic institutions into 
their components, thereby focusing on the influence of each. This 
takes the form of a progression of models beginning with an effi- 
ciency benchmark. Then, one by one, political features are added 
until the final form models a representative legislature divided into n 
districts. The approach shows how political institutions transform the 
economic basis of costs and benefits into political costs and benefits. 
The latter, and not their economic counterparts, define rational deci- 
sions for political actors. 

The model reveals three important sources of bias. The first is a 
consequence of the political definition of benefits and costs and its 
divergence, in important respects, from the economic definition. The 
second source stems from the districting mechanism which divides the 
economy into n disjoint political units called districts. The method of 
project financing through generalized taxation constitutes the third 
source of bias. Moreover, we show that the mechanism of popular 
election of legislative representatives complements these sources of 
bias so that these three sources, in conjunction with the reelection 
mechanism, explain the inefficiency of political choice. 


2 See Lowi (1964) for a discussion of different kinds of policies based on the charac- 
teristics of beneficiaries as well as the mechanisms of financing. 
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The General Approach 


Our model is one of policymaking in the realm of distributive policy, 
so we begin by clarifying that concept. A distributive policy is a 
political decision that concentrates benefits in a specific geographic 
constituency and finances expenditures through generalized taxation. 
These policies—sewage treatment plants, land reclamation, the rivers 
and harbors omnibus, urban renewal projects—authorize collections 
of projects, each targeted to a geographic location and each generat- 
ing benefits in that geographic location unrelated to projects in other 
locations. This latter characteristic is crucial since the omnibus-like 
quality of distributive programs allows decisions to be made on a 
project-by-project, locality-by-locality basis. Each project of the om- 
nibus may be fashioned independent of others in the omnibus. 

While it is clear that all policies have a geographic incidence of 
benefits and costs, what distinguishes a distributive policy is that 
benefits are geographically targeted. In contrast, a nondistributive 
program, say an entitlement program, though having geographic 
incidences, is fashioned with nongeographic constituencies in mind, 
for example, socioeconomic groups. Subsidies to beekeepers, for 
example, generate a distribution of benefits that depends on the 
geographic distribution of beekeepers. No geographic area has a 
claim on program benefits except as it contains residents in the enti- 
tled category. Thus, by our definition, programs targeted to the 
malnourished (food stamps), the unhealthy (Medicare), the poor (wel- 
fare), the retired (social security), the injured worker (workmen’s 
compensation), or the automobile driver (automotive product safety) 
are not distributive policies because any citizen may obtain program 
benefits if he falls in the specific category. Thus, an entitlement 
program confers benefits on all individuals in the designated cate- 
gory, benefits which may not be varied without similarly varying them 
for others in the category. Although the motivation to create nondis- 
tributive programs may have a geographic basis (as when a politician 
is moved to support a policy because many of his constituents fall into 
the beneficiary group), the fact remains that the beneficiary group is 
not geographically defined or determined. In contrast, geography is 
the hallmark of distributive politics: Programs and projects are geo- 
graphically targeted, geographically fashioned, and may be inde- 
pendently varied. Importantly, geography is also the basis for political 
organization and representation. 

A distributive policy for the jth district, P; (x), is a project located in 
that district, where x is a decision parameter. Although x may be 
treated as a vector of project characteristics, we assume that x simply 
describes the scale or size of the project. Associated with the project 
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P;(x) are both benefits and costs. Let (x) represent the present value 
of the economic benefits which flow from the project to the particular 
political constituency. This includes consumption benefits, say cleaner 
water from a sewage treatment plant, and potential pecuniary gains to 
producers, for example, increased profits to project input owners 
from price rises in factor markets.’ 

ASSUMPTION I: b’(x) > 0, b"(x) < 0. 
Let c(x) represent the total resource cost involved in producing the 
project. It decomposes into three components, c(x) = ¢;(x) + ¢2(x) + 
c3(x). The first component, ¢,(x), is the real resource expenditures for 
project inputs spent in the constituency in which the project is located; 
C2(x) is the real resource expenditures for project inputs spent outside 
the district; c;(x) is nonexpenditure real resource costs imposed on 
the district (e.g., nonpecuniary externalities, such as the destruction 
of the natural environment, and pecuniary externalities in the form 
of price rises to consumers in factor markets).’ 

ASSUMPTION 2: c{(x) > 0, cf(x) = 0,7 = 1, 2, 3. 
The expenditures are financed through taxes so that the tax bill for 
P;(x) is 


T(x) = €,(x) + ¢2(x). (1) 


We assume a tax system that covers all expenditures, assigning non- 
negative tax share ¢; to the ith district, where Yj; t; = 1 and n is the 
number of districts. The tax bill for the ith district for the project 
P;(x), therefore, is t;[c,(x) + ¢2(x)]. 

As the above development suggests, there are several mechanisms 
at work in the realm of distributive policy which our model captures. 
First, economic benefits are geographically concentrated in a politi- 
cally relevant way. Second, production costs may be unpacked, again 
in a politically relevant way. Some costs are extracted from the econ- 
omy and returned as geographically earmarked expenditures—c,(x) 
and c)(x); other costs are nonexpenditure in nature, imposed on the 
local economy in which the project is located—c;(x). Third, the tax 
bill, T(x), is paid for by each political subdivision according to the tax 
shares t;,7 = 1,..., 7. The relevant mechanisms which we examine 


3 Some students of cost-benefit analysis, e.g., McKean (1958, chap. 8), argue against 
including pecuniary external effects in the calculations, claiming instead that these 
constitute distributional effects that are not germane for efficiency determinations. The 
treatment of pecuniary external effects remains controversial in cost-benefit analysis. 
Our results reported in this paper do not depend on how this controversy is resolved. 

4 There is a fourth component of costs, namely, nonexpenditure real resource costs 
that spill over into other political constituencies. Since P;(x) is a concentrated project, 
we ignore this for the present. We return to this point in the concluding section. For 
analysis of other kinds of policies not possessing the particular properties of distributive 
projects, see Shepsle and Weingast (1980). 


646 JOURNAL OF POLITICAL ECONOMY 


below are (1) the political cost-accounting mechanism, (2) the dis- 
tricting mechanism, and (3) the taxation mechanism. After examining 
these mechanisms and their effects on the characteristics of distribu- 
tive policy, we explore the complementarity of legislators’ reelection 
objective. First, however, we develop the familiar efficiency criterion. 


Model E: Maximizing Economic Efficiency 


The benchmark for the entire set of political institutions developed 
below is the efficiency criterion. This requires the maximization of 
economic net benefits. This is given simply by 


max E(x) = b(x) — c(x). (2) 
The familiar first- and second-order conditions are 
piece (, 
and (3) 
b” ie cl a O. 


The second-order condition follows directly from assumption 1 and 
assumption 2 so that the solution to (2), x” in figure 1, is a unique 
global maximum. 


The Politicization of Expenditures 


The first transformation of the standard approach is the politicization 
of economic costs. This transformation rests on a crucial political 
property. Project costs, paid from general revenues, become geo- 
graphically earmarked expenditures. The political process distributes 
those expenditures in the form of c,-type and c,-type costs. Thus, for 
P;(x) in district 7, production inputs are purchased from firms and 
individuals in the district (c,) and from their counterparts outside the 
district (c2). More important is the political evaluation of the distribu- 
tion of geographically earmarked expenditures. While these expen- 
ditures are not lump-sum transfers of wealth to factor owners, they do 
entail pecuniary gains since they represent increases in demand for 
project input factors, thereby driving up their prices.> Not only does 


* This statement presumes that factors of production are geographically fixed in the 
short run. This does not preclude the bidding away of pecuniary gains as the long-run 
supply of factors adjusts to this change in demand. For some projects, which are one 
shot and nonrepetitive in nature, the short-run analysis holds since the long-run 
adjustment process is attenuated. For other programs, in which a permanent increase 
in demand has occurred (e.g., the continuing flow of military procurement projects to a 
district), long-run market forces adjust with the concomitant bidding away of pecuniary 
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the public project entail new business for input owners; it allows them 
to receive a higher price for the sale of inframarginal units as well.° As 
this discussion suggests, geographic expenditures are important be- 
cause they distribute pecuniary effects. Consequently, the distribu- 
tional effects of local expenditures combine with consumption 
benefits in the district’s valuation of a project. Put simply, a dollar’s 
valuation of a project may come in either of two forms: a pecuniary 
gain to a factor owner or a benefit to a project consumer. Partly as a 
consequence of these distributional effects, and partly for additional 
reasons enumerated below, the political evaluation of pecuniary ef- 
fects diverges from their economic treatment. 

We may distinguish several classes of agents who are differentially 
advantaged or harmed (in addition to their tax obligations) by the 
provision of P;(x): (1) in-district consumers, who receive benefits 
through consumption of the public project but are unaffected by 
pecuniary effects; (2) in-district factor owners, who obtain pecuniary 
gains in production of, as well as benefits in consumption from, P(x); 
(3) out-of-district factor owners, who obtain pecuniary gains in produc- 
tion (but no consumption benefits since they do not reside in the local 
constituency); (4) in-district consumers who make factor market purchases, 


gains. Nevertheless, the political effect is the same, though now manifested in an 
aversion to pecuniary losses suffered if demand were to contract (i.e., the flow of 


projects cease). 
6 There are pecuniary losers as well—namely, other users of project factors who 


experience rising prices. 
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who obtain consumption benefits from the project but suffer 
pecuniary losses in the form of higher prices for factors; and (5) 
out-of-district purchasers of factors, who suffer pecuniary losses through 
higher prices in factor markets (and who, moreover, obtain no con- 
sumption benefits since they do not reside in the district). Public good 
theorists like Samuelson, Bator, and Head, in their market-failure 
models of public good provision, typically focus only on type 1 agents 
and the nonappropriable product of public projects. Cost-benefit 
studies and welfare analyses (Prest and Turvey 1965; Mishan 1976) 
also focus on type | agents, limiting the role played by the other four 
types. These studies compute costs and benefits in consumption fol- 
lowed by an appropriate adjustment for pecuniary effects. But these 
studies may have missed the point. As Aranson and Ordeshook (1978) 
have emphasized generally, and Stigler (1971) and Peltzman (1976) 
for the case of regulation, it may well be the appropriable pecuniary 
gains and losses of factor owners and competing factor users, respec- 
tively, that drive the political production process. In this latter view, 
the consumption benefits of the public project—clean water, mass 
transit, or whatever—are a by-product of factor owners and factor 
users seeking pecuniary gains and the avoidance of pecuniary losses, 
respectively. 

How these pecuniary effects are distributed geographically, and 
whether they are gains or losses, have different kinds of political 
impacts. Since political representation is geographic, legislators care 
about who gains and who loses in proportion to their geographic 
locations. We assume that local gains and losses are politically more 
significant to the legislator’s objectives (reelection and constituency 
service) than nonlocal effects. Hence representatives use whatever 
legislative influence they can exercise on behalf of those affected 
locally by pecuniary effects. 

Additionally, there are several reasons to believe that pecuniary 
gains are exaggerated and pecuniary losses diminished in the repre- 
sentative’s political calculus. They relate to the concentration of 
pecuniary gains and the dispersion of pecuniary losses. First, in what 
might be termed the “Robert Moses effect” (named after that famous 
New Yorker who appreciated and exploited it so effectively), is the 
observation that pecuniary gains in the form of increased jobs, profits, 
and local tax revenues go to named individuals, firms, and localities 
from whom the legislator may claim credit and exact tribute. 
Pecuniary gains may be targeted to constituents; pecuniary losses, on 
the other hand, are often more widely dispersed, falling on con- 
stituents and nonconstituents alike. Second, pecuniary losses, princi- 
pally through higher prices in factor markets, are not always fully 
linked to the effects of increased factor demand from the project in 
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question. Indeed, the illusion may be such that pecuniary losers are 
unable to distinguish the source of their losses from general price 
inflation. Hence there is a perceptual asymmetry between pecuniary 
gains and losses. Accompanying this asymmetry in perception is an 
asymmetry in capacity to convert perceptions of gains and losses into 
political influence. Third, then, as Peltzman (1976) has noted in 
another context, gainers typically are smaller in number, more cohe- 
sive in political interest, and, consequently, better organized politi- 
cally. They are capable of rewarding the local legislator for delivering 
the bacon in a fashion in which pecuniary losers are unable to punish. 
The combined impact of the Moses, the illusion, and the Peltzman 
effects is an exaggerated political importance accorded local 
pecuniary gainers. The local legislator, then, is strongly encouraged 
to generate projects with large c;-type components (vs. ¢2-type) and 
tends to be less concerned with associated pecuniary losses (vs. 
pecuniary gains). 

Since, on the arguments above, there is political value in securing 
local expenditures for their own sake, the representative, in assessing 
the project P;(x), incorporates, on the benefit side of his political 
calculus, both the consumption benefits his constituents obtain, 6(x), 
and the politically distorted pecuniary effects. Since the latter depend 
on local expenditures (c,[x]), we write it as f[c,(x)]. For the proposi- 
tions below, we stipulate f[c,(x)] = ¢,(x) to enable the clear presenta- 
tion of results unencumbered by mathematical detail; our results are 
qualitatively unaffected so long as f and f’ are positive.’ Thus, the 
representative of the single political constituency seeks a project scale 
that maximizes his political maximand which, in turn, depends on his 
constituency’s benefits minus its share of the costs: [b(x) + ¢(x)] — 
[t;T (x) + ¢3(x)]. 


Model P: A Single Political Constituency 


Consider the case of a single national constituency. Then ¢; = 1 and 
expenditures earmarked for the constituency now consist of all ex- 


7 A straightforward comparative statics analysis, which we do not pursue here, would 
examine how alternative specifications of f affect optimal choices. Such an analysis 
would confirm that as long as/ is positive, the qualitative nature of our results stands. As 
noted, it is not too misleading to equate f[c;(x)] with c,(x) or some linear function 
thereof, namely, f[c,(x)] = Bc:(x), in the relevant range. But only up to a point! If 
expenditures yield not only consumption benefits but 6 dollars per dollar expended of 
additional “benefit,” then, subject only to external costs (cs[x]), legislators would be 
motivated to expend the entire GNP, clearly an absurdity. The function Ffles(x)] is 
meant to represent the political (read: electoral) advantage secured by the legislator 
who delivers c,(x) dollars of public expenditure to his district. Ultmately this advantage 
must tail off so that, in a general mathematical analysis, f [c,(x)] eventually exhibits 
diminishing returns. 
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penditures, that is, c,(x) + (x). With these assumptions and the 
identity in (1), the political objective function reduces to 


max P(x) = Lbs) FP 6.x) i Col) ba (eae ae Gao trent) | 


(4) 
= b(x) — €3(x). 


The first-order condition is 

b’ —c3 = 0, (5) 
and the second-order condition requires that 

De Ghee 


The solution to equation (5) yields point x” on figure 1. The following 
theorem shows that x? > x®. 

‘THEOREM, 1x o- xe) 

Proor: From assumptions 1 and 2, x” is the unique global 
maximum of E(x) = b(x) — ¢,(x) — ¢2(x) — ¢3(x). Consequently, b’(x”) 
= i(x") — 63(x") = ca(x") — 0. From assumption 2, ci(x-) + ex) 
0; therefore, P’(x®) = b'(x®) — c3(x®) > 0. But this violates the 
first-order condition for a maximum of P (x). Indeed, from the prop- 
erties of strictly concave functions, since P’(x”) > 0, then either (i) 
P'(x) = 0 for x = x? > x*, or (ii) P’(x) = 0 for no finite x. In the first 
case, x” is a global interior maximum which exceeds x”. In the second 
case, there is no interior maximum since P(x) increases without 
bound. A fortiori, a project scale larger than x” is preferred. Q.E.D. 


Model N: Non Est e Pluribus Unum, or Every District for Itself 


The next stage in our examination of political mechanisms is to 
partition the single national constituency into multiple, disjoint politi- 
cal units called “districts” with representation in a legislature. Each 
district, through its representative, is presumed to maximize its net 
(private) benefits without regard to the costs imposed on other dis- 
tricts. Publicly supported projects are funded through taxes which fall 
primarily on other districts. Hence, the benefits are concentrated 
while the costs are diffused. 

Any political choice institution consisting of representatives of mul- 
tiple, disjoint constituencies is characterized by a principle of 
aggregation—that is, a voting or decision rule—and by the substan- 
tive choices made under that decision rule. There are several litera- 
tures which address these issues. The theoretical thrust of these 


8’ These include research on logrolling, constitutional choice, and distributive 
policymaking. There is a large literature on each of these topics, but Buchanan and 
Tullock (1962) remains the best introduction and overview. 
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literatures is twofold: (1) Why is one institutional decision rule em- 
ployed rather than another? and (2) What are the policy consequences 
of a given decision rule? Regarding the first, with special emphasis on 
majority rule, theorists have focused on the instability of decisive 
coalitions, the ex ante uncertainty surrounding the composition of 
winning coalitions, and the cost of organizing and monitoring coali- 
tion partners. Their express purpose is to address how institutional 
actors seek, through suitably arranged institutional practices, to im- 
prove their circumstances by evolving coping strategies, formal rules, 
and other mechanisms (called norms in the sociological literature, 
rules or binding commitments in the game theory literature, and 
contracts in the property rights literature). The second question takes 
institutional practices as fixed and examines the policy decisions im- 
plied by those practices. The historical origins of and rationales for 
institutional rules are of little concern in these latter analyses. 

Our chief concern is the second question in which we assess the 
project choices of a legislature in the distributive policy area. In those 
policy areas characterized by a project-by-project orientation, the 
geographic concentration of benefits, and the diffusion of costs, there 
is abundant evidence that universalism and reciprocity are prevailing 
decision rules in the U.S. Congress. The former practice assures any 
interested district a project; the latter, in recognition of the fact that 
district differences translate into different policy priorities, facilitates 
a process of mutual support and logrolling. These two practices 
combine to permit packages of distinct projects earmarked for in- 
terested districts to obtain the support even of those without a stake in 
the package in exchange for reciprocal treatment. Empirical studies, 
moreover, repeatedly observe the operation of universalistic criteria. 
Examples include the pork barrel of rivers and harbors (Maass 1951; 
Ferejohn 1974); model cities and urban renewal (Plott 1968); tax 
loopholes (Manley 1970); the traditional tariff (Schattschneider 
1935); private member bills (Froman 1967); military procurement 
(Rundquist 1973); and categorical grants-in-aid (Mayhew 1974). In- 
deed, some scholars observe a tendency to infuse controversial 
policies with distributive elements in order to build a more inclusive 
coalition.’ 

Elsewhere (Fiorina 1978; Weingast 1979; Shepsle and Weingast 
1981) the conditions under which institutional actors prefer univer- 
salistic criteria to pure majority rule are derived. These results, to- 
gether with the preponderance of empirical evidence cited above, 
provide the basis for our focus on universalism and its policy conse- 
quences. First we explore the consequences of universalism in the 


9 See Stockman (1975) and Fiorina (1978) for illustrations. 
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extreme in which each legislator independently determines whether 
to propose a project and, if so, at what scale. The following section 
then examines the conditions under which there are incentives for 
legislators to coordinate project selection. Consider legislator 7's 
maximand regarding his district’s project:’° 


Nj(x) = b(x) + x(x) — &T(x) — ¢3(x). (6) 


The legislator is presumed to maximize the district’s private benefits, 
b(x) + c,(x), minus its share of the taxes, t;T(x) = t[ce,(x) + c2(x)], 
minus the externalities of the project which fall on the district, ¢3(x). 
This simply reflects the proposition that legislators are reelection 
oriented and that their prospects are positively associated with the net 
benefits they deliver to their constituents. According to this reelection 
mechanism, voter decisions correspond to a “what have you done for 
me lately?” evaluation." 
Maximization of (6) yields the first-order condition 


bY +e, -4t7T —c¢=0, 
or 
b' + cj =t,(c; + cg) + G3. (7) 
The second-order condition requires 
bY + Cl — tic, + 6s) — €3.—.0, 


Equation (7) has a solution yielding a maximum point x” for (6) 
depicted in figure 1. The next theorem establishes that, in a wide set 
of circumstances, x" exceeds x”. 

THEOREM 2: If cj > t;(cj + cg), then x > x?. 

PRrooF: Note that N(x) defined in (6) may be recast as 


N(x) = P(x) + ex(x) — &Lea(x) + co(x)], 
where P(x) = b(x) — c3(x). At the maximum of P(x), x = x? 
Pi(x) = _ 0 (Xb =e, 1) — 0. 


This implies that at x = x? 


> 


N (x2 \ = Piel ee cx") — 6 lex? ) (x) 
= x(x") — tlei(x*) + c2(x*)). 
Therefore, the premise of the theorem implies that N’(x) > 0 for all x 
' Eq. (6) does not contain district j’s total benefits and costs, only those falling within 
its control. Theorem 4 below establishes the innocuousness of this omission. 


"' This model of voting is known as retrospective voting in contrast to the prospective 


voting model, initially popularized by Downs (1957), in which voters respond to prom- 
ises for future policy. 
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<x?. Consequently, if N(x) possesses a maximum—call itx”—then x” 
> x?. Moreover, if N(x) does not possess a maximum, then our 
theorem obtains a fortiori since N(x) then increases without bound. 
OE-D. 

The condition in the premise of theorem 2, ¢; > t(¢i + ¢2), can be 
presumed to hold in most circumstances. This condition requires that, 
in district-initiated projects, locally earmarked expenditures grow 
more rapidly with project scale than local taxes. Since ¢; is ordinarily 
quite small, only a modicum of imagination by legislators is required 
to find projects with this local expenditure characteristic. For exam- 
ple, if taxes are shared evenly by districts, then ¢; = 1/n, where n is the 
number of districts, and the condition becomes 


1 
cy > —(cy + €3). 
n 


That is, so long as district-targeted expenditures grow with project 
scale at a rate at least 1/nth as fast as total expenditures, the premise 
(and hence the conclusion) of theorem 2 holds; in even moderate- 
sized legislatures this is a fairly weak condition. 

More generally, suppose t; = ;(z) where z might be any characteris- 
tic (examples of which are given below). Then the following com- 
parative statics result may be established. 

THEOREM 3: Let x” be the optimum for (6). Then dx%/dz > 0 if 


and only if tj; < 0. 
Proor: Rewriting (6), we have 


N(x) = b(x) + [1 — &(z)ler(~) — & (2) e2(x) — e3(x), 

and first-order condition 

N'(x) = b'(x) + (1 — &(2)ler(x) — t(z)e2(*) — ¢3(x) = 9. 
The second-order requirement is 

N"(x) = b"(x) + [1 — &(z)]ei(x) — t(z)e2(x) — ¢3(x) < 0. 
Totally differentiating the first-order condition yields 

{b"(x) + [1 — )(z)]et(x) — 4 (z)ez(x) — e3(x) pdx 
= [c(x) + 03(x)]t5(z)dz. 


Thus, 


dz {0%(x) + T1 — & (z)lei(x) — &)(z)ex(x) — ea(x)} 


The numerator of the coefficient on the right-hand side is positive 
(from assumption 2) and the denominator is negative (from the 


Kp [ef(x) + of(x)] t1(z). 
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second-order condition). The coefficient, therefore, is negative, es- 
tablishing that the sign of dx/dz is opposite that of t)(z): 


, d. 
i3(2)< 0>— > 0, 


d 
ti(z) > Cae 0. 


Q.E.D. 

Coro.iary (The Law of 1/n): If district tax share is a declining 
function of the number of districts (n), then the degree of inefficiency 
in project scale (x” — x*) is an increasing function of the number of 
districts. 

Proor: Let ¢; = t;(n) with ¢j(n) < 0. Then, from theorem 3, dx /dn > 
05 QOlED: 

The corollary indicates that when taxes are apportioned as a de- 
creasing function of the number of political units—for example, ¢; (7) 
= 1/n for allj—then the optimum project scale for any district grows 
as the polity is more finely partitioned into districts.’ Theorem 3, 
however, is more general, for it applies to tax mechanisms that may be 
the function of any politically relevant characteristic. If a district’s tax 
share is a decreasing function of certain of its legislator’s institutional 
characteristics (membership or influence on tax-writing committee), 
political characteristics of its representative (is he a committee chair- 
man? is he associated with the majority party?), or economic or demo- 
graphic characteristics of the district (proportion of families below 
poverty level, proportion of population above age 65), then we can 
associate increasingly inefficient projects with particular kinds of dis- 
tricts as defined by these tax-relevant characteristics. In all these cases, 
theorem 3 indicates that the equilibrium scale of a district’s project 
(and, given the assumptions, its degree of inefficiency) changes with 
respect to some tax criterion in precisely the opposite way the tax 
share changes with respect to that criterion. 


Institutional Incentives to Restrict Project Size 


We have just shown that decentralized choice by a representative 
legislature characterized by a universalism mechanism, a tax-sharing 


" This analysis presumes that the only change in N(x) following a change in the 
number of districts is the tax rate, t;. If, however, ¢1(x), C(x), and c3(x) depend on the 
configuration of districts, then two countervailing tendencies may be observed. In- 
creasing the number of districts (1) transforms some portion of ¢,(x) into co(x) and (2) 
decreases the tax share of the district. Since the first effect reduces and the second 
increases the optimal project scale, the net effect is ambiguous without further 
specification. 
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rule, and a political objective function yields a vector of projects x” = 
(x, ...,x¥). Theorems | and 2 provide the conditions in which x? > 
xf,i = 1,...,n. However, x?, the project scale for district 7 that 
maximizes its legislator’s political objective function (6), is computed 
in isolation of computations by other legislators; therefore, it does not 
take the expenditure and tax externalities (positive and negative) 
generated by those other projects into account. 

In this section, we seek to determine whether or not there is an 
institutional basis for restraining unbridled universalism.’ In par- 
ticular, we turn to an examination of packages of projects and seek to 
discover whether there exists a package of constitutionally restricted 
projects, x° = (xf,...,x,), with the property that x© >,x™ (where >; is 
the preference order of district 7 over packages of projects). To 
accomplish this, consider the complete political maximand of legis- 
lator j, B’(x;, . . . , Xn), which rewrites (6) to incorporate the effects of 
projects in other districts. The net benefits to districtj consist of b-type 
benefits from its own project, b(x;); ¢;-type expenditures from its own 
project, ¢,(x;); c2-type expenditures spent in district j from other 
projects, Liz; Coj(x;); nonexpenditure costs from its own project ¢3(x;); 
and its tax share of the total expenditures, ¢; Dia (es (x) + co(%;)|. Thus 


B(x, «5 x) = [Bl05) + e055) + Die lad| 
iFi 
n (8) 
- | ca(xs) +t) Cer(m) + ca(xi)]}. 
i=1 

Before considering the possibility of restrictions on project size, we 
first characterize choice under the complete political maximand, 
Bi(x,,...,%n). Calling x? legislator j’s solution to the maximization of 
(8), we have: 

THEOREM 4: x? = x}. 

ProoF: Equation (8) may be rewritten 


Bi = {b (x5) + €4(x;) — €3(xj) — tler(xj) + C(x; )]} 


(9) 
as (> C25 (xi) — bj yi [e,(x;) + cx(xi)]} 
AI iA 
which, from (6), becomes 
BY = N,(xj) + (> C25 (i) — bj y Ley(xi) + cals} (10) 
iFi ixj 


3 In Shepsle and Weingast (1981), we describe results that demonstrate the ex ante 
superiority of universalism to pure majority rule in the eyes of each legislator. Here we 
ask whether a restricted form of universalism, in turn, is superior to pure universalism. 
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Since the terms in braces are not functions of x;, the first- and 
second-order conditions for B’ are the same as those for N;(x;); thus, 
their respective maxima are the same. Q.E.D. 

Theorem 4 establishes that our initial focus on N; (instead of B’) 
involved no loss of generality for questions about decentralized proj- 
ect choice. Moreover, theorem 4 establishes that x} is a Nash strategy 
for legislator j and that the project vector, x”, is a Nash equilibrium." 

Turning to the role of cooperatively imposed restrictions on project 
size, we state a characterization theorem that establishes the circum- 
stances under which districts (or their representatives) prefer a re- 
stricted collection of projects, x©, to the collection under unrestrained 
universalism, x’. For the vector of projects x* = (xf,...,xn), let E;(x*) 
represent the tax and expenditure effects of other projects on district 
j. Specifically, 


Bs(xX) = >) {tsLea x!) + colt) — cos (xf). 
iAj 
In effect, E;(x*) is the tax bill for district 7 for all other projects net of 
local expenditures to district 7 from these other projects—that is, net 
negative external costs. 
THEOREM 5: x© >; x” if and only if Nj(x¥) — Nj(x°) < E;(x¥) 
a E; (xo): 

Theorem 5 conveys the following idea: Legislator; has an interest 
in substituting a collection of projects x© = (xf,...,xn) forx” = (x¥,..., 
xn) 1f and only if his district’s reduction in political benefits from the 
reduced scale of its own project is compensated for by a concomitant 
reduction in its burden of net negative external costs. There are 
several ways in which this may fail to hold for a given district j. For 
example, (i) district 7 has a sufficiently small tax share, ¢;; (ii) district 7 
is, disproportionately, a source of project inputs for other districts, in 
which case the ¢2;(x;) terms are large; and (iii) the political benefits for 
district j with project scale x}, b(x7) + c,(x¥), are inordinately large. In 
each of these cases, the condition may fail, implying the absence of a 
unanimous preference for x© over x”. Moreover, there do not appear 
to be any interesting properties associated with the family of functions 
b(x), c(x), and ¢(x) that satisfy the condition in theorem 5; nor is there 
any ex ante basis for supposing that the condition in theorem 5 will be 
encountered in empirical settings. 

Despite the lack of unanimous preference for restriction under all 


“ This view interprets our model of unrestricted universalism as a game. The choice 
set for player 7 is the size of his project, x;, when project choice is governed by a 
universalism mechanism. The Nash equilibrium follows from the separability of the 
positive and negative externalities of other projects in (10). As a result, each district 
makes its own decisions without attending to the externalities it produces or consumes. 
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circumstances, legislators have something to gain by properly ac- 
counting for the pecuniary externalities of project scale selection. The 
next theorem provides some insight into the optimal set of projects 
which internalizes these effects. 
THEOREM 6: x? maximizes &j-, B’(x), given in (8). 
PROOF: 
n 


LG >, B(x) = yy {> (es) + €1(x;) >) C25 (xj) — €3(%;) 


= ey [ex(2,). + ca(xi)] 


n 


S* [b (x5) + 15) + 2x5) — €3(x;) — €1(%5) — C25) 


ja 


3 


= > [b(5) ~ ealas)h 


Jt 


The first-order conditions are 


OL 
Ox; 


Noting that (6?L)/(0x,0x;) = 0 for 2 # j, the second-order conditions 
are 


I UX CAR Oy fly awe g fhe 





O7L 


Fe = PH) Be) <0, f= Lem 





These equations and inequalities are the same as (5), which identifies 
the vector x” in figure 1. Q.E.D. 

Theorem 6 shows that total political net benefits are maximized 
with the vector x? = (x?,...,x%). Potential gains may be captured if 
the system of universalistic project selection is amended so that, while 
each district is assured a project, the project scale is determined as if 
there were but a single district (as in model P above). Notice that 
theorem 6 does not assert thatx? >; x%,j = 1,...,n (substituting x” for 
x, theorem 5 shows the restricted circumstances in which this will 
hold). What theorem 6 does assert is that a compensation scheme 
which redistributes net benefits is feasible so that x” together with this 
compensation is preferred by all districts not only to x” but to any 
other omnibus of projects. 

It is occasionally asserted that the distributive politics game is a 
prisoner’s dilemma in the economic sense—that unrestrained univer- 
salism produces a project package that is an economically inefficient 
Nash equilibrium, on the one hand, and is unanimously regarded as 
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less preferable than x® (x” >;x",7 = 1,...,n) on the other. Theorems 
1, 2, and 4 establish the first part of this assertion. However, retaining 
the political conceptualization of net benefits given in (8), theorem 5 
shows that the second part of the assertion does not always follow. 
The collection x” is not always unanimously regarded as preferable to 
x”. More importantly, theorem 6 shows that even when all represen- 
tatives favor restrictions, the politically optimal set of projects is x”, not 
ra 

This discussion underscores the basic point of the paper. The 
efficient collection, x”, though normatively attractive in welfare analy- 
sis, is not always behaviorally relevant. The implication of the political 
maximand (8) and theorems 1, 2, 5, and 6 is that legislators hold no 
brief for efficiency, per se, either with regard to their own project 
selection or a package of such projects. 


Extensions and Discussion 


The model developed in the previous sections roots the inefficiency of 
distributive politics in democratic mechanisms and especially in the 
geographic basis of political constituencies. This latter feature pro- 
duces two independent sources of bias. First, locally targeted expen- 
ditures are counted by the local constituency as benefits. Second, the 
districting mechanism in conjunction with the taxation system pro- 
vides incentives to increase project size beyond the efficient point by 
attenuating the relationship between beneficiaries and revenue 
sources. A cooperative legislature has no incentive to remove entirely 
these sources of inefficiency (beyond that described in the discussion 
following theorem 6). 

In this section, we examine several related themes and applications. 
Each of these is either an extension of our model to domains beyond 
that of traditional distributive policy or a specialization of our model 
to substantively relevant cases of distributive policy. 


Congressional Limitation on Project Size 


A well-known behavioral mechanism has operated in Congress since 
the 1880s that restrains the attempts of legislators to fund their pet 
projects. At the authorization stage, a universalism mechanism is at 
work—the annual omnibus public works bill, for example, typically 
contains authorizations for projects in most congressional districts. 
Following the passage of authorizing legislation comes the separate 
stage of actually appropriating monies. Here, the Committee on Ap- 
propriations systematically scales down each project (for a description 
of this process, see Fenno [1966] and Ferejohn [1974]) 
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Our model sheds some light on this well-established congressional 
practice. First, suppose the premise of theorem 5 holds, where x” = 
x. Hence, x? >;x*,7 = 1,...,n. Second, assume legislator 7 and only 
legislator j knows x} and x}. Theorem 5 implies that legislators 
unanimously favor a mechanism reducing project sizes from the un- 
coordinated choice, x¥. But there does not appear to be a straightfor- 
ward “demand-revealing” mechanism inducing legislators to an- 
nounce anything other than their maximizing Nash strategy, x’. 
Thus, in the face of imperfect information, the simpler mechanism of 
scaling down all projects—the current practice of the appropriations 
committee—may yield a vector of project sizes less than x” and pre- 
ferred by all legislators.” 


Examination of Interesting Subclasses 


We mention briefly, as an indication of the utility of this approach, 
some interesting special cases of the political maximand (6): 


N (xj) = B(x) + €1(x3) — €3(x5) — tler(xj) + €2(x;)]. 


The details may be provided by the reader. 

1. Benefit tax.—This tax scheme requires the district to pay the 
entire cost of a strictly local public project (4; = 1). The maximand 
becomes 


N;(x;) = 5(x;) — C2(xj) — €3(x;), 


and assumptions 1 and 2 imply a project scale x% with the property xe 
<x¥ <xP <x. Note that if co(x;) = 0, thenx# = x? whereas, if ¢,(x;) = 
0, then x* = x”. This last fact suggests that a benefit tax in conjunction 
with no local expenditures is a sufficient condition for public sector 
efficiency. 

2. Free-ride tax.—In this case, t; = 0 and the maximand is 


N (xj) = b(x;) + €1(x;) — €3(x5), 


which implies a scale x% > x¥. This case approximates local public 
goods, financed by user fees, where the local residents rarely number 
among the users. For example, if Yellowstone National Park were 
financed by user fees assessed states in proportion to their respective 


15 A complete analysis must resolve two issues. First, as a consequence of the scaling 
down practice by the Appropriations Committee, will the legislator strategically seek a 
project scale in excess of x% so that, when it is scaled down, it will eventually reach size 
x? Alternatively, are there sanctions discouraging artificial inflation of project scale at 
the authorizing stage? Second, are there sanctions available to be applied against the 
Appropriations Committee to ensure it does not scale down too much? See Fenno 
(1966) for some empirical details. These issues concerning demand revelation mecha- 
nisms are theoretically intriguing but take us too far afield to be dealt with here. 
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share of the population using the park, then Wyoming residents 
receive a cheap, if not free, ride on taxes. 

3. Foreign aid versus military assistance. —We can interpret the politi- 
cal popularity of the latter and unpopularity of the former via the 
following calculation. Let x” and x™ describe levels of the two forms of 
aid; assume b(x/) = b(x”), ceteris paribus, for any political constit- 
uency (assumed, in any event, to be small unless the constituency 
contains partisans of or emigrants from the benefiting country); for 
some constituencies (producers of military hardware), c,(x/) > 0; for 
all constituencies, on the other hand, c,(x‘ ) = 0; for all constituencies, 
C3(x¥) = c3(x4i) = 0. Thus, 


N(x) = b(xif) + (1 — t)er(%f) — tea(x¥), 
N(x} ) = b(x5) — t¢2(x5). 


Assumptions | and 2 imply («)* > (x#)*, an observation consistent 
with their respective popularities. It also makes sense of the recent 
policy innovation of attaching strings to foreign aid requiring recip- 
ients to make purchases in the United States (c,-type expenditures). 

4. Rube Goldberg machines and military bases. —Suppose b(x;) = 0 for 
a project in a given constituency, 7. Then 


N (x5) = CG )ei() Gea (x) = a Ge). 


Even though 6(x;) = 0, the (politically) optimal project scale may be 
greater than zero. Specifically, 


x’ > 0 if N’(0) > 0, or (1 — ¢;)c/(0) — t,¢4(0) — c4(0) > O. 


Consider the case of many military bases (and other Rube Goldberg 
machines) which, by the Department of Defense’s own admission, 
provide virtually no contribution to defense (b[x] = 0). These never- 
theless remain attractive to local constituencies (hence x¥ > 0) because 
of the overriding importance of these projects to the local economy in 
the form of c;-type benefits. Throughout this paper, we have em- 
phasized the political inappropriateness of economic net benefits as a 
relevant decision criterion. In this special case, it appears that even the 
absence of economic benefits altogether is not a disqualifying charac- 
teristic in political choice. 


Generalization to Non—Pork Barrel Policies*® 


One of the central features of our models is the unpacking of costs in 
politically relevant ways in which we distinguish project costs returned 


'® These themes are developed in more detail in Shepsle and Weingast (1980). 
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to the district as expenditures, costs returned to other districts, and 
nonexpenditure costs borne within the district. We focused, however, 
on distributive or pork barrel projects, defined as projects whose 
benefits are geographically concentrated and whose costs are spread 
through general taxation. Two extensions offer further insight into 
nondistributive policies. The first distinguishes an additional cost, 
c4(x), or external, nonexpenditure costs borne by other districts. The 
second defines another source of economic benefits, b2(x), that ac- 
crues to other constituencies as a consequence of a project in a par- 
ticular district. 

1. Additional external costs.—If additional external costs of the jth 
project spill over into other districts, it can easily be shown that the 
degree of inefficiency increases when governed by decentralized 
project choice under the political maximand, N(x). Since the model 
now divides externalities into their politically relevant components, 
c3(x) and c4(x), we can make further observations about the degree to 
which the political system can be relied upon to internalize exter- 
nalities associated with public activities. If the externalities are not too 
large (in the sense that they are local and do not extend into 
neighboring districts), the public sector action governed by a repre- 
sentative legislature internalizes them. However, if they are large, 
public sector action may not. 

The politically relevant (though economically arbitrary) distinction 
implied by district boundaries suggests that the public sector can only 
provide certain categories of public goods which are not available 
through private market arrangements. Hence, jurisdictional ques- 
tions become of paramount importance when producing a local pub- 
lic good like flood walls along a river which divides two political 
jurisdictions. Some of the most infamous cases of pork barrel politics 
illustrate this point: The flood walls along the lower part of the 
Mississippi River, which divides Louisiana from Mississippi, are 3 feet 
higher on the Mississippi side (see Ferejohn 1974, pp. 56-58). Simi- 
larly, the levees on the Indiana side of the Wabash River are higher 
than those on the Illinois side. Thus, a universalistic representative 
legislature is biased toward projects with low c3-type costs while failing 
to consider c4-type costs. 

9. Additional external benefits. —Let b(x;) = b,(x;) + b2(xj;) where b;(x;) 
are the benefits of the jth project concentrated in district 7 and 62(x;) 
are the benefits consumed by residents of other districts (presumed 
zero throughout the body of the paper). That is, b2(x;) is the positive 
consumption externalities (as compared with c2[x;], which are the 
positive production externalities). 

Because the benefits outside the district are not readily internalized 


662 JOURNAL OF POLITICAL ECONOMY 


under a distributive policy mechanism, large multidistrict (multistate) 
projects are likely to be rare relative to projects with concentrated 
local benefits. Consequently, multidistrict public goods (in the eco- 
nomic sense) are not only likely to be underproduced by a market 
mechanism but by a representative legislature as well (see Aranson 
and Ordeshook 1978). A universalistic representative legislature is 
biased toward projects with high ;-type benefits, while failing to 
internalize b,-type benefits. Thus, both positive and negative exter- 
nalities adversely affect public as well as private provision. 


Conclusion 


Throughout this paper we have focused on the sources of inefficiency 
in public decision making. Our model demonstrates that democratic 
institutions play an important role. Three mechanisms were shown to 
influence the politically optimal project choice: the political cost- 
accounting mechanism, the districting mechanism, and the taxation 
mechanism. These features of the political economy systematically 
transform the economic benefits and costs into political counterparts. 
Since it is the latter that determine the maximands for political actors 
and not their economic counterparts, these govern political choice. 
While our modeling of the districting and taxation mechanisms is 
straightforward and uncontroversial, there are circumstances in 
which our treatment of the incidence of gains and losses from local 
expenditures is implausible. We have presumed that pecuniary gain- 
ers figure more prominently than pecuniary losers in a legislator’s 
reelection constituency. However, under some circumstances this may 
not be true. A legislator’s reelection constituency, for example, may be 
dominated not by factor owners of a public project but by those who 
would bear the brunt of the pecuniary losses and the nonpecuniary 
external costs of the project. We would not, therefore, expect the 
legislator to seek such projects. Indeed, since the menu of distributive 
programs is sufficiently diverse, there normally is something available 
for everyone. Thus we tend to find reclamation projects in the West, 
locks and dams in river districts with an active construction industry, 
and wildlife refuges in Sierra Club districts. Because of this diversity 
in policy preferences and program categories, the logic supporting 
the political distortion of pecuniary incidences continues to hold. 
Our principal conclusion is that since political institutions funda- 
mentally alter the perceptions and incidence of benefits and costs, 
they systematically bias project choices away from the efficient out- 
comes. In the context of distributive politics, this was shown to imply 
larger projects and programs than are economically warranted. 
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Flexible Exchange Rates, Prices, and the Role of 
“News”: Lessons from the 1970s 
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This paper analyzes the key issues and lessons from the experience 
with flexible exchange rates during the 1970s. It analyzes the effi- 
ciency of the foreign-exchange market and the volatility of exchange 
rates, as well as the relationships between exchange rates and interest 
rates. A key distinction is made between anticipated and unantici- 
pated events, and it is shown that the key factor affecting exchange 
rates has been “news.” The analysis then proceeds to analyze the 
relationship between exchange rates and prices. The deviations from 
purchasing power parities are being interpreted in terms of the 
modern asset-market approach to the exchange rate. 


I. Introduction 


Recent experience with flexible exchange-rate systems has led to 
renewed interest in the operation of foreign-exchange markets as 


An earlier version of this paper was prepared for a conference (Stabilization Policy: 
Lessons from the 1970’s and Implications for the 1980's) sponsored by the Federal 
Reserve Bank of St. Louis and the Center for the Study of American Business at 
Washington University held at the Federal Reserve Bank of St. Louis on October 
19-20, 1979. I am indebted to Lauren J. Feinstone for helpful suggestions and efficient 
research assistance and to the National Science Foundation (grant SOC 78-14480) for 
financial support. I have benefited from useful comments by Andrew Abel; William 
Branson; Sebastian Edwards; Stanley Fischer; Craig S. Hakkio; Edi Karni; Paul Krug- 
man; Leonardo Leiderman; Robert E. Lucas, Jr.; Allan Meltzer; Michael L. Mussa; Sam 
Peltzman; Nasser Saidi; and Roland Vaubel, as well as of participants in seminars held 
at the NBER, Columbia University, New York University, Harvard University, the 
University of Virginia, the University of Rochester, Oxford University, Tel-Aviv Uni- 
versity, and the International Monetary Fund. This research is part of the NBER’s 
program in international studies. The views expressed are those of the author and not 
necessarily those of the NBER. 
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reflected in many recent studies of the principal determinants of 
exchange rates. The 1970s witnessed the dramatic alteration of the 
international monetary system from a regime of pegged exchange 
rates which prevailed for about a quarter of a century (since the 
Bretton Woods Conference) into a regime of flexible (though man- 
aged) rates. As a consequence of the emergence of the new legal and 
economic system, traders, national governments, and international 
organizations were confronted with new economic problems, choices, 
and instruments. During the 1970s exchange rates fluctuated widely 
and inflation rates accelerated. The international monetary system 
had to accommodate extraordinarily large oil-related shocks which 
affected trade flows in goods and assets. Huge oil payments had to be 
recycled. Uncertainties concerning future developments in interna- 
tional politics reached new heights, and the prospects for the world 
economy got gloomier. These developments placed unprecedented 
pressures on the markets for foreign exchange as well as on other 
asset markets. They were associated with a large slide in the value of 
the U.S. dollar and resulted in speeding up the creation of new 
institutions like the European Monetary System which provided the 
formal framework for the management of exchange rates among 
members. The increased interdependence among countries and the 
realization that exchange-rate policies by one national government 
exert influence on other economies have also induced legal responses 
from international organizations. For example, in late April 1977, the 
executive board of the International Monetary Fund approved the 
details of the second amendment to article IV of the amended Articles 
of Agreement dealing with the principles and procedures for surveil- 
lance of exchange-rate policies of member countries. 

These developments provide the background for this paper which 
is intended to sum up the relevant evidence bearing ona set of related 
questions and to present a brief survey of key issues and lessons from 
the experience with floating rates during the 1970s. The main orien- 
tation of the paper is empirical and the analysis is based on the 
experience of three exchange rates involving the dollar/pound, the 
dollar/French franc and the dollar/deutsche mark (DM). Section II 
provides an analysis of the efficiency of foreign-exchange markets by 
examining the relationship between spot and forward exchange rates. 
The extent of exchange-rate volatility is also examined. This analysis 
of the foreign-exchange markets sheds light on several questions 
including (i) whether exchange rates fluctuate “excessively”; (ii) 
whether speculation in the foreign-exchange markets is destabilizing; 
(ii) whether there is “insufficient” speculation in the foreign- 
exchange markets; and (iv) whether there is evidence for market 
failure in the sense that there are unexploited profit Opportunities. 
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These issues are significant in assessing the performance of floating 
rates as well as in evaluating the need for government intervention in 
the foreign-exchange markets. The analytical framework that is used 
for interpreting the volatility of exchange rates and the association 
between spot and forward rates is the modern theory of exchange- 
rate determination. Within this framework, exchange rates are 
viewed as the prices of assets that are traded in organized markets 
and, like the prices of other assets, are strongly influenced by expec- 
tations about future events. 

The relationship between exchange rates and interest rates is 
analyzed in Section III. One of the key issues that is raised in this 
section is the distinction between anticipated and unanticipated 
changes in rates of interest. As an analytical matter this distinction is 
important, because the modern approach to exchange-rate determi- 
nation implies that exchange rates are strongly influenced by “news,” 
which by definition is unpredictable. Therefore, it is unanticipated 
rather than anticipated changes in interest rates that should be closely 
associated with changes in exchange rates. This prediction is tested 
empirically. 

Section IV analyzes the relationship between exchange rates and 
prices by examining the patterns of deviations from purchasing 
power parities. The main point that is being emphasized is that there 
is an important intrinsic difference between exchange rates and na- 
tional price levels. Exchange rates are more sensitive to expectations 
concerning future €évents than national price levels. As a result, in 
periods which are dominated by “news” which alters expectations, 
exchange rates are likely to be more volatile, and departures from 
purchasing power parities are likely to be the rule rather than the 
exception. The analysis of the relationship between exchange rates 
and prices is relevant for assessing whether the flexible exchange-rate 
system was successful in providing national economies with an added 
degree of insulation from foreign shocks and whether it provided 
policy makers with an added instrument for the conduct of mac- 
roeconomic policy. The evidence regarding deviations from pur- 
chasing power parities is also relevant for determining whether there 
is a case for managed float. Section V contains concluding remarks. 


II. The Efficiency of the Foreign-Exchange Market and the 
Movement of Exchange Rates 


In this section I analyze the principal characteristics of the relation- 
ship between spot and forward exchange rates which seem to emerge 
from the experience of the 1970s. Following an analysis of the effi- 
ciency of the foreign-exchange market, I discuss the more general 
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issues underlying the relationships between spot and forward rates 
and their volatility. 


The Efficiency of the F oreign-Exchange Market 


One of the central insights of the monetary (or the asset-market) 
approach to the exchange rate is the notion that the exchange rate, 
being a relative price of two assets, is determined in a manner similar 
to the determination of other asset prices and that expectations con- 
cerning the future course of events play a central role in affecting 
current exchange rates.* 

If the foreign-exchange market is efficient and if the exchange rate 
is determined in a fashion similar to the determination of other asset 
prices, we should expect current prices to reflect all currently avail- 
able information. Expectations concerning future exchange rates 
should be incorporated and reflected in forward exchange rates. ‘To 
examine the efficiency of the market, I first regress the logarithm of 
the current spot exchange rate, In S;, on the logarithm of the 1-month 
forward exchange rate prevailing at the previous month, In F;_;, as in 
equation (1):? 


In S7 =-d.-- 0 lin fre tye (1) 


If the market for foreign exchange is efficient, so that prices reflect 
all relevant available information, then the residuals in equation (1), 
u;, Should contain no information and therefore should be serially 
uncorrelated. Further, if the forward exchange rate is an unbiased 
forecast of the future spot exchange rate (as should be the case under 
an assumption of risk neutrality), then the constant term in equation 
(1) should not differ significantly from zero,’ and the slope coefficient 
should not differ significantly from unity. I examine three exchange 
rates: the dollar/pound, the dollar/franc, and the dollar/DM. Equa- 


‘ For collections of articles summarizing this approach, see the Scandinavian Journal of 
Economics, no. 2 (1976) and Frenkel and Johnson (1978). 

* For an application of the same methodology in analyzing the efficiency properties 
of the foreign-exchange market during the German hyperinflation of 1921-23 see 
Frenkel (1976, 1977, 1979). For an application to other exchange rates during the 
1920s see Frenkel and Clements (1981); for an application to the 1920s and the 1970s 
see Krugman (1977); for an interesting analysis using time-series and cross-sectional 
data see Bilson (in press); for an analysis of market efficiency using novel econometric 
techniques see Hakkio (1979) and Hansen and Hodrick (1980), and for surveys see 
Levich (1978, 1979). 

* More precisely, if (assuming risk neutrality) the forward rate measures the expected 
value of the future spot rate, then the constant term in the logarithmic equation (1) 
should be —.507; see Frenkel 1979. The statement that under risk neutrality the 
forward rate equals the expected future spot rate neglects the effects of the stochastic 
elements in prices. As an empirical matter this neglect does not seem to be consequen- 
tial; see Frenkel and Razin 1980. 
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tion (1) was estimated using monthly data for the period June 1973— 
July 1979 (data sources are listed in the Appendix). The beginning of 
the period was set so as to concentrate on the experience of the 
current exchange-rate regime (following the initial post—Bretton 
Woods transition period). The resulting ordinary-least-squares (OLS) 
estimates are reported in table 1. Also reported in table 1 are addi- 
tional regressions which will be analyzed shortly. 

As may be seen for the dollar/DM exchange rate, the hypotheses 
that (at the 95 percent confidence level) the constant term does not 
differ significantly from zero and that the slope coefficient does not 
differ significantly from unity cannot be rejected. These hypotheses 
are rejected for the dollar/franc exchange rate and are rejected (mar- 
ginally) for the dollar/pound exchange rate. The joint hypotheses, 
however, that the constant is zero and that the slope coefficient is 
unity cannot be rejected at the 95 percent level for the dollar/pound 
and the dollar/DM exchange rates and at the 99 percent level for the 
dollar/franc exchange rate. The test statistics for testing the joint 
hypotheses are reported in the column headed by F in table 1. These 
results are relevant for assessing whether the forward rate is an 
unbiased forecast of the future spot rate. We turn next to the question 
of efficiency. 

It was argued above that in an efficient market, expectations con- 
cerning future exchange rates are reflected in forward rates and that 
spot exchange rates reflect all currently available information. If 
forward exchange rates prevailing at period ¢ — 1 summarize all 
relevant information available at that period, they should also contain 
the information that is summarized in data corresponding to period 
t — 2. It thus follows that including additional lagged values of the 
forward rates in equation (1) should not greatly affect the coefficients 
of determination and should not yield coefficients that differ 
significantly from zero. The results reported in table 1 are consistent 
with this hypothesis; in all cases the coefficients of In F;_2 do not differ 
significantly from zero, and the inclusion of the additional lagged 
variables does not improve the fit. Most important, in all cases the 
Durbin-Watson statistics are consistent with the hypothesis of the 
absence of first-order autocorrelated residuals, and an examination of 
higher-order correlations (up to 12 lags) shows that no correlation of 
any order is significant.” 

To examine further the relationship between the various exchange 
rates, we note that one of the assumptions underlying equation (1) 


4 Since In F;_, is highly correlated with In S,_,; the Durbin-Watson statistic may not be 
appropriate since equation (1) is very similar to a regression of In S; on its own lagged 
value, Durbin’s h-statistic reveals, however, that the residuals are serially uncorrelated 
at conventional confidence levels. 
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was that the forward exchange rate measures the unobservable value 
of the expected future spot exchange rate. This assumption provided 
the justification for using equation (1) instead of the explicit specifica- 
tion of the rational expectations hypothesis that is embodied in 
equation (2): 


In S,= Eey In Sea Er, (2) 


where E,_; In S,; denotes the expected (logarithm of the) spot ex- 
change rate for period ¢ based on the information available at period 
t — 1. If, however, the forward exchange rate att — 1 is a “noisy” proxy 
for the expected future value of the spot rate (1.e., it measures it witha 
random error), then we would obtain 


foo ine, — £ ind, +0; £ w,)— 0, (3) 
and substituting equation (3) into equation (2) yields: 
In Si =at b In FS + (€&; we V{-1)- (4) 


In this case the error term in equation (1) would be u; = €& — vy-1; the 
assumption that the covariance between In F;_,; and u; is zero would 
entail a specification error, and the application of the OLS procedure 
would yield biased estimates due to the classical errors in variables 
bias. 

In order to examine the possibility that the OLS estimates might be 
subject to the errors in variables bias, one needs to test the hypothesis 
that cov (u;, In F;_) = 0. This test follows the specification test outlined 
by Hausman (1978).° To perform the test, equation (1) was estimated 
by applying the OLS procedure as well as by using an instrumental 
variables (IVAR) estimation method. Under the null hypothesis of no 
misspecification the OLS coefficients vector by is an efficient and 
unbiased estimate of the true coefficient vector. Under the alternative 
hypothesis of misspecification, the vector b, is biased and an unbiased 
coefficient vector 6, can be obtained by applying an instrumental 
variables estimation procedure. The test statistic relevant for testing 
the null hypothesis can be written as 


m = (6, — by)'(var 6, — var Deer ies bo), (5) 


where var (b,) and var (by) denote the variance-covariance matrices of 
b, and by, respectively. Under the null hypothesis, m is distributed (in 
large samples) as x” with 2 degrees of freedom. Table 1 reports the 
results of estimating equation (1) by applying the instrumental vari- 
ables estimation method. As may be seen for all exchange rates the 


5 This test was recently applied by Obstfeld (1978) to the analysis of the foreign- 
exchange market during the 1970s and by Frenkel (1980a, 19806) to the analysis of the 
foreign-exchange markets during the 1920s. 
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two vectors of coefficients 6, and by are very close to each other. For 
example, for the dollar/pound exchange rate, the constants are .033 
and .030, and the slopes are .956 and .961—consequently, the re- 
sulting m-statistic is 2.01, which is well below 5.99—the critical value of 
x?(2) at the 95 percent confidence level. The m-statistics correspond- 
ing to the other exchange rates are also below this critical value. It is 
concluded, therefore, that the use of the forward exchange rate as a 
proxy for expectations does not introduce a significant errors in 
variables bias and, thus, the use of the OLS estimation procedure 
seems appropriate. ® 

The principal conclusion that may be drawn from the previous 
discussion is that the behavior of the foreign-exchange market 
during the 1970s has been broadly consistent with the general impli- 
cations of the efficient market hypothesis. 


Exchange-Rate Movement: Volatility and Predictability 


In this section I analyze the volatility of exchange rates and the extent 
to which this volatility is predictable. As is well known, during the 
same period exchange rates have been very volatile. The standard 
errors of the monthly percentage changes of the three exchange rates 
have been about 3 percent per month. Further, the standard errors of 
the regressions in table 1 indicate that the forecasts of future spot 
exchange rates based on the forward rates are very imprecise: The 
standard errors of the equations are about 3 percent per month. 


* The efficiency of the foreign-exchange market can also be analyzed from a differ- 
ent angle as in Frenkel (19806). Consider the equation 


n 
Xt = Ay + ayt + S Bete eet a ie 
i=1 


where x; denotes the percentage change of the spot exchange rate (In S; — In S,_1); 7-4 
denotes the forward premium on foreign exchange (In F;_, — In S,_,); t denotes time; n 
denotes the number of lags; and w denotes an error term. If 7;_,; summarizes all 
available information concerning the future evolution of the exchange rate, then given 
the value of the forward premium 7;_,, the past history of the percentage change of the 
exchange rate should not “help” the prediction (i.e., the past history should not be 
viewed as Granger-causing future changes), and the joint hypotheses that a, a,, and f; 
are zero and that y is unity should not be rejected. The results of applying these tests to 
the three exchange rates for various numbers of lags as well as to the pooled data base 
of the three exchange rates show that the null hypothesis cannot be rejected at the 95 
percent confidence level, since the values of the various F-statistics fall well below the 
corresponding critical values. It is noteworthy, however, that the power of this test is 
low and that the joint hypothesis that ao, a, B;, and y are zero could also not be rejected. 
The difficulties in “explaining” the percentage change of the exchange rate in terms of 
past values of various variables reflect the fact that like the prices of other assets which 
are traded in organized markets, changes in exchange rates are dominated by “news” 
which by definition could not have been incorporated in past changes or in the lagged 
forward premium. For a further elaboration see the following section. 
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These characteristics of price changes (volatility and unpredictabil- 
ity) are typical of auction and of organized asset markets. In such 
markets current prices reflect expectations concerning the future 
course of events, and new information which induces changes in 
expectations is immediately reflected in corresponding changes in 
prices, thus precluding unexploited profit opportunities from arbi- 
trage. The strong dependence of current prices on expectations about 
the future is unique to the determination of durable asset prices; it is 
not a characteristic of price determination of nondurable com- 
modities. The strong dependence of asset prices on expectations also 
implies that periods which are dominated by uncertainties, new in- 
formation, rumors, announcements, and “news,” which induce fre- 
quent changes in expectations, are likely to be periods in which 
changes in expectations are the prime cause of fluctuations in asset 
prices. Further, since the information which alters expectations must 
be new, the resulting fluctuations in price cannot be predicted by 
lagged forward exchange rates which are based on past information.’ 
Therefore, during such periods, one should expect exchange rates 
(and other asset prices) to exhibit large fluctuations. When the prime 
cause of fluctuations is new information, one may expect that lagged 
forward exchange rates (which are based on past information) are 
imprecise (even though possibly the best unbiased) forecasts of future 
spot rates. 

To gain further insights into the implications of this perspective on 
the relationship between predicted and realized changes in exchange 
rates, figures 1-3 present plots of predicted and realized percentage 
changes in exchange rates for the three pairs of currencies where the 
predicted change is measured by the lagged forward premium. Also 
presented in these figures are the differentials in national inflation 
rates which are discussed in Section IV. The key fact which emerges 
from these figures is that predicted changes in exchange rates account 
for a very small fraction of actual changes.* This phenomenon is also 
reflected in the comparison between the variances of actual and pre- 
dicted changes in exchange rates: In all cases the variances of monthly 
percentage changes in exchange rates exceed the variances of 
monthly forward premia by a factor that is larger than 20.° This fact 
suggests that the bulk of exchange-rate changes seem to be due to 


7 The analysis of the role of “news” in determining current exchange rates and in 
explaining forecast errors from the forward rate has been made forcefully by Mussa 
(1976a, 19766, 1977, 1979a) and Dornbusch (1978). The large degree of volatility is 
also analyzed by McKinnon (1976), who attributes it to insufficient speculation. 

8 These and the following empirical regularities are analyzed in detail in Mussa 
(1979a). See also Frenkel and Mussa 1980. 

9 For an analysis of the relationship between the variances of series of predictions and 
series of realizations see Shiller (1979) and Singleton (1980). 
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Fic. 1.—Monthly percentage changes of the U.S./U.K. consumer price indices [A(In 
COL ys/COLpx)]; of the $/£ exchange rate (A In S,); and the monthly forward premium 
(In(,-,/S;-1)], July 1973-July 1979. 


“new information” which, by definition, could not have been antici- 
pated and reflected in the forward premium or discount which pre- 
vailed in the previous period. 

This view of the foreign-exchange market can be exposited in terms 
of the following simple model.!° Let the logarithm of the spot ex- 
change rate on day t be determined by 


In S; = % + bE(In Sy, — In Sp, (6) 


where E In Si, — In S,) denotes the expected percentage change in 
the exchange rate between ¢t and t + 1, based on the information 
available at ¢, and where z, represents the ordinary factors of supply 
and demand that affect the exchange rate on day ¢. These factors may 
include domestic and foreign money supplies, incomes, levels of out- 
put, etc. Equation (6) represents a sufficiently general relationship 
which may be viewed as a “reduced form” that can be derived froma 
variety of models of exchange-rate determination. These models may 


* The following paragraph draws on Frenkel and Mussa (1980). 
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Fic. 2.—Monthly percentage changes of the U.S./French consumer price indices 
[A(In COLys/COL,)]; of the $/Fr exchange rate (A In S,); and the monthly forward 
premium [In(F,_,/S,-,)], July 1973—July 1979. 


differ in their emphasis of the determinants of z(t), but they all are 
likely to share a similar reduced form.'! Assuming that expectations 
are rational in that equation (6) applies to expectations of future 
exchange rates, it follows, by forward iteration, that 


k 


aaa b 
E, In S143 = nimi ye (spp) Bete (7) 


Thus, the current exchange rate (j = 0) and current expectations of 
future exchange rates (j > 0) are linked, because both depend on 
expectations concerning the future z’s. The strength of the link de- 
pends on the magnitude of b which characterizes the dependence of 
the current exchange rate on the expected percentage change 
thereof.!2 The presumption is that, due to profit opportunities from 


1! See, e.g., the comprehensive econometric model of Fair (1979). 

122A result of this general form is derived in Mussa (1976a). The unique role of 
expectations is also emphasized by Black (1973), Dornbusch (1976c, 1978), Kouri 
(1976), and Bilson (1978). In general, the value of b may be viewed as the relevant 


676 JOURNAL OF POLITICAL ECONOMY 


US/Germany 


PERCENT CHANGE 





1973 1974 1975 1976 1977 1978 1979 


Fic. 3.—Monthly percentage changes of the U.S./German consumer price indices 
[A(In COLys/COL,)]; of the $/DM exchange rate (A In S,); and the monthly forward 
premium [In(F;_,/S,_,)], July 1973-July 1979. 


arbitrage, this link is strong at least for the exchange rates expected in 
the near future. Hence, the current exchange rate, In S; = E; In S,, 
should be closely linked to the current expectation of the next pe- 
riod’s exchange rate, E; In S;,,, which in turn should be closely linked 
to the exchange rate expected for the following period, E,1n S;4, and 
so on. 

In order to examine this hypothesis I present in figures 4-6 plots of 
the spot and the contemporaneous forward exchange rates for the 
three pairs of currencies. Also presented are the ratios of national 
price levels which are discussed in Section IV. If the dominant factor 
underlying changes in rates is new information which alters views 
about current and expected future exchange rates by approximately 
the same amount, then one should expect a high correlation between 
movements of spot and forward rates. This fact is clearly demon- 


parameter for determining whether a specific commodity (whose pricing rule is de- 
scribed in terms of eqq. like [6]-[7]) may be viewed as an asset. The higher the value of b 
is for a given commodity, the larger is its asset attribute. 
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EXCHANGE RATES AND PRICE RATIOS 
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Fic. 4.—Monthly observations of the $/£ spot (In S,) and forward (In F,) exchange 
rates and the ratio of the U.S./U-K. cost of living indices [In(COLys/COLpx)] (scaled to 
equal the spot exchange rate at the initial month): June 1973—July 1979. 


strated by figures 4-6 where it is seen that spot and forward exchange 
rates tend to move together and by approximately the same amount 
(the vertical difference between the two rates corresponds to the 
percentage forward premium or discount on foreign exchange). The 
correlations between the spot and the forward exchange rates for 
the three pairs of currencies exceed .99, and the correlations between 
the corresponding percentage changes of the spot and forward rates 
exceed .96. The high correlation between movements in spot and 
forward rates is expected, since the two rates respond at the same time 
to the same flow of new information (which is presumed to affect the 
rate for more than one period). In general, the details of the relation- 
ship between spot and forward exchange rates depend on the time- 
series properties of the z’s in equation (7) and in particular on whether 
the new information is viewed as permanent or transitory.” 


13 New information might be “permanent” when the relevant horizon is 1 month, 
while it might be transitory when the relevant horizon is a year. In that case, the 
correlation between the spot exchange rate and the contemporaneous 1-month for- 
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EXCHANGE RATES AND PRICE RATIOS 


US/France 
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Fic. 5.—Monthly observations of the $/Fr spot (In S,) and forward (In F;) 
exchange rates and the ratio of the U.S./French cost-of-living indices [In(COLys/COL,)] 
(scaled to equal the spot exchange rate at the initial month): June 1973—July 1979. 


The comovement of spot and forward rates is evidence of the close 
link between current and expected future exchange rates which is 
illustrated by equation (7). This characteristic is typical of the 
foreign-exchange market and is also shared by prices of many assets 
and commodities traded in organized markets. The recent pattern of 
gold prices provides a useful example of this general principle. Table 
2 reports the spot and the futures prices of gold as recorded recently 
in the International Money Market (IMM) at the Chicago Mercantile 
Exchange on 5 consecutive days. The two key facts which are illus- 
trated by this table are (i) the extent of day-to-day volatility in gold 
prices and (ii) the general uniformity by which these changes are 
reflected in the price of gold for immediate delivery as well as in the 
prices for the eight future delivery dates. 


ward rate is likely to be high, while the correlation between the spot rate and the 
12-month forward rate is likely to be low. The perceived permanence of the new 
information can be inferred from the correlation between the spot and the various 


maturities of the forward rate. As expected, it is generally found that this correlation 
diminishes with the maturity of the forward contract. 
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EXCHANGE RATES AND PRICE RATIOS 
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Fic. 6.—Monthly observations of the $/DM spot (In S;,) and forward (In F;) 
exchange rates and the ratio of the U.S./German cost-of-living indices [In(COLys/ 
COL,)] (scaled to equal the spot exchange rate at the initial month): June 1973—July 
1979. 


Another feature which is revealed by figures 4—6 is that the con- 
temporaneous spot and forward exchange rates are approximately 
equal, thus indicating that the market’s best forecast of the future spot 
rate is (approximately) the current spot rate. This phenomenon re- 
flects the fact that, as an empirical matter, exchange rates have fol- 
lowed (approximately) a random-walk process. For such a process, 
current prices are, indeed, the best forecasts of future prices (and to 
the extent that the exchange rate had some drift, the statement above 
should be interpreted in reference to that drift). It is relevant to note, 
however, that while the random-walk phenomenon seems to corre- 
spond to the actual paths of exchange rates, it does not reflect a 
theoretical necessity. 

The final characteristic of the foreign-exchange market is described 
by figures 7—9, which plot for the three pairs of currencies the spot 
exchange rate and the forward premium on forward exchange. Since 
the spot rate and the forward premium are expressed in terms of 
different units (where the latter is expressed as percentages per 
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EXCHANGE RATE AND FORWARD PREMIUM 








US $/UK £ 












1973 1974 1975 1976 1977 1978 1979 


Fic. 7.—Monthly observations of the normalized $/£ spot exchange rate (In S/’) 
and the normalized forward premium [In(F/S)”]. Both series are normalized by sub- 
tracting from each series its mean and by dividing by the corresponding standard error: 
June 1973-July 1979. 


month), the two series were normalized for the purpose of the plots 
by subtracting from each series its mean and by dividing by the 
corresponding standard error.'* The fact which emerges from these 
figures is that generally (though not always) there is a positive corre- 
lation between the expected depreciation of the currency (as mea- 
sured by the forward premium on foreign exchange) and the spot 
exchange rate. This positive correlation may be rationalized by noting 
that currencies which are expected to depreciate are traded at a 
discount in the forward market and, on average, these currencies also 
command a lower foreign-exchange value in the spot market." This 


14The normalized values of the spot rate, In S?, and of the forward premium, 
In(F/S,)%, which are plotted in figs. 7-9, are defined as In SY = (In S,; — In S)/oin s, and 
In(F,/S)* = (In(F,/S, — In(F/S)V/ ounces, where a bar over a variable indicates its 
sample mean, and where o denotes the sample standard deviation of the subscripted 
variable. 

15 It is noteworthy that since the forward premium (like the rate of interest) and the 
exchange rate are dimensionally incommensurate, their association raises questions 
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Fic. 8.—Monthly observations of the normalized $/Fr spot exchange rate (In S$?) 
and the normalized forward premium [In(F;/S,)"]. Both series are normalized by sub- 


tracting from each series its mean and by dividing by the corresponding standard error: 
June 1973-July 1979. 


relationship is embodied in the specification of equation (6) and is 
interpreted further in the next section. 


III. Exchange Rates, Interest Rates, and Innovations 


This section contains an analysis of the relationship between exchange 
rates and interest rates from the perspective of the monetary (or the 
asset-market) approach to the exchange rate. Following a discussion 


of the broad facts, the analysis proceeds with an empirical examina- 
tion of the role of “news.” 


Exchange Rates and Interest Rates: The Broad Facts 


To set the stage for this section, it is useful to recall the analysis which 
predicts a negative association between the rate of interest and the 


that are familiar from the discussions of the Gibson paradox. In a separate paper I 
intend to examine the relationship between exchange rates and the forward premium 
(or the interest differential) in light of the various explanations of the Gibson paradox. 
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Fic. 9.—Monthly observations of the normalized $/DM spot (In S/’) and the 
normalized forward premium [In(F,/S,)*]. Both series are normalized by subtracting 
from each series its mean and by dividing by the corresponding standard error: June 
1973—July 1979. 


exchange rate. According to that analysis, a higher rate of interest 
attracts foreign capital which induces a surplus in the capital account 
of the balance of payments and thereby induces an appreciation of 
the domestic currency (i.e., a lower spot exchange rate). Another 
variant of this approach states that the higher rate of interest lowers 
spending and thus induces a surplus in the current account of the 
balance of payments which results in a lower spot exchange rate. A 
third variant claims that the higher rate of interest implies (via the 
interest parity theory) a higher forward premium on foreign ex- 
change; and to the extent that at a given point in time the forward 
exchange rate which represents the expected future spot rate is pre- 
determined by past history—as would be the case under the adaptive 
expectations hypothesis (which is clearly rejected by the evidence on 
the comovements of spot and forward rates)—the required rise in the 
forward premium will be brought about by a lower spot rate (1.e., by 
an appreciation of the domestic currency). Whatever the route, by 
ignoring the distinction between nominal and real rates of interest, 
this approach predicts a negative relationship between the rate of 
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interest and the spot exchange rate (or, alternatively, a positive re- 
lationship between the rate of interest and the foreign-exchange value 
of the domestic currency). 

While such a prediction might be appropriate for noninflationary 
environments, it might be entirely inappropriate for inflationary en- 
vironments (like the one prevailing in the United States in recent 
years). In such periods variations in rates of interest are most likely to 
be dominated by variations in inflationary expectations rather than by 
liquidity effects associated with changes in the ratio of money to 
bonds. In such an environment, the rate of interest is expected to be 
positively correlated with the exchange rate. The broad facts are con- 
sistent with this hypothesis. Over the recent period, the rise in the rate 
of interest in the United States (relative to the foreign rate of interest) 
has been associated with a rise in the spot exchange rate (i.e., with a 
depreciation of the dollar) rather than with a fall in the spot rate. 
Figure 10 illustrates the point by plotting the foreign-exchange value 
of the U.S. dollar against the interest rate differential. As is evident, 
the higher (relative) rate of interest in the United States has been 
associated with a higher exchange rate (i.e., with a lower foreign- 
exchange value of the dollar). 

The positive association between the rate of interest and the ex- 
change rate in the context of the U.S. dollar and the inflationary 
environment can be accounted for by the monetary (or the asset- 
market) approach to the exchange rate which puts a special emphasis 
on the influence of the expectations on the current values of exchange 
rates.'® For example, according to the monetary approach, a rise in 
the domestic (relative) rate of interest which is primarily dominated 
by a rise in the expected (relative) rate of inflation induces a decline in 
the demand for real cash balances; for a given path of the nominal 
money supply, asset-market equilibrium requires a price level which is 
higher than the price which would have prevailed otherwise. When 
the domestic price level is linked to the foreign price through some 
form of purchasing power parity, and when the path of the foreign 
price is assumed to be given, the higher domestic price can only be 
achieved through a rise in the spot exchange rate (i.e., through a 
depreciation of the currency).!7 


‘For theoretical developments and applications of the approach see, e.g., 
Dornbusch (1976a, 1976b), Frenkel (1976), Kouri (1976), Mussa (1976a), Bilson (1978), 
Frenkel and Johnson (1978), Hodrick (1978), Frankel (1979), Clements and Frenkel 
(1980), and Frenkel and Clements (1981). 

‘It should be emphasized that this explanation of the positive association between 
the rate of interest and the exchange rate does not rely on a rigid form of the 
purchasing power parity theory. It only requires that domestic and foreign price levels, 
when expressed in terms of the same currency, are positively correlated. The evidence 
from the 1970s is consistent with this requirement; see Frenkel 1981. 
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Fic. 10.—Foreign-exchange value of the U.S. dollar and interest rate differentials. 
Source: Mudd 1979. 


This explanation of the positive association between interest rates 
and exchange rates has an intuitive appeal in that it implies that, in an 
inflationary environment, a relatively rapid rise in prices is associated 
with high nominal rates of interest as well as with a depreciation of the 
currency in terms of foreign exchange. This relationship is embodied 
in equation (6), which states that an expected depreciation of the 
currency (which in our case is associated with inflationary expecta- 
tions and high nominal rates of interest) results in an immediate 


depreciation.** 
The foregoing analysis also provides the explanation for the obser- 


ative relationship between interest rates and the 
tionalized under the assumption that it concen- 
trates on the short-run liquidity effects of monetary changes. It should be emphasized, 
however, that in an inflationary environment, like the one prevailing in the United 
States, the applicability of this rationalization is very limited. The short-run liquidity 
effect is emphasized in Dornbusch (1976). The role of inflationary expectations in 
dominating exchange-rate developments is emphasized in Frenkel (1976). Edwards 
(1979) and Frankel (1979) attempt to integrate these two factors. 


18 The traditional prediction of a neg 
exchange rate could, in principle, be ra 
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vation (which was noted in Sec. II) that, generally, there is a positive 
correlation between the forward premium on foreign exchange and 
the level of the spot rate. Since during inflationary periods the spot 
rate 1s expected to be positively correlated with the interest rate 
differential, and since according to the interest parity theory that 
differential must equal the forward premium on foreign exchange, it 
follows that the forward premium is also expected to be positively 
correlated with the level of the spot rate.'® 


Exchange Rates and “News” 


One of the central implications of the rational expectations hypothesis 
is that unanticipated events, “news,” play a predominant role in af- 
fecting real variables and asset yields. This implication is embodied in 
many expositions of modern macroeconomics, and its empirical con- 
tent has been the subject of numerous recent studies.”° In the context 
of exchange-rate determination the discussion in Section II and in 
particular the contributions by Mussa (1977, 1979a) and Dornbusch 
(1978) emphasized that the predominant cause of exchange-rate 
movements is “news” which could not have been anticipated.”! Ex- 
pressing the spot exchange rate at period ¢ as the sum of factors which 
were anticipated from the past, as well as factors which represent 
“news,” Dornbusch (1978) decomposes the effects of “news” into 
those which alter the expected future spot rate between the last 
period and the present and those which lead to a reassessment of the 
l-period interest rate differential starting at the present, that is, 
“news” about the term structure. Both of these white-noise, serially 
uncorrelated components play a role in determining the spot ex- 
change rate in Dornbusch’s analysis. 

The evidence presented in Section II suggests that the forward rate 
summarizes the information that is available to the market when the 
forward rate is being set, and in equation (3) it was assumed that the 
expected exchange rate can be written as a + b In Fi, plus a serially 
uncorrelated error. We may, therefore, express the spot rate at pe- 
riod ¢ as a function of factors which have been known in advance and 


are summarized by the lagged forward rate, as well as a function of 
the “news”: 


InS,;=a + bInF,_, + “news” + wu. (8) 
In what follows this notion is applied to an empirical analysis of the 


'9 For evidence on the robustness of the interest parity relationship see Frenkel and 
Levich (1977). 

* See, e.g., Barro 1977 and Fischer 1980. 

“1 See also Bilson 1978; Frenkel and Mussa 1980; Isard 1980; and Longworth 1980. 
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role of “news” as a determinant of the exchange rate. The key 
difficulty lies in identifying the variable which measures the “news.” 
Since, quite frequently, it is difficult to observe and quantify the 
“news,” it is convenient to examine the relationship between the 
exchange rate and a variable whose time series is likely to manifest 
the “news” promptly. Assuming that asset markets clear fast and that 
the “news” is immediately reflected in (unexpected) changes in the 
rates of interest, equation (8) may be written as 


In Si =a rt b In ie = al (a Pad a); as Ej, oT a*) ] a Wt, (9) 


Pete | Sonne 


exchange rate 


where the bracketed term denotes the innovation in the (1-month) 
interest differential and where E;_,(2 — 7*), denotes the interest differ- 
ential which was expected to prevail in period ¢ based on the informa- 
tion available at t — 1. The expected interest rate differential was 
computed from a regression of the interest differential on a constant, 
on two lagged values of the differential, and on the lagged forward 
exchange rate In F;-4. 

As was argued above, the association between exchange rates and 
interest rates is likely to be positive during periods in which most of 
the variations in nominal rates of interest are dominated by variations 
in inflationary expectations—a characteristic which seems to fit the 
inflationary environment of the 1970s. Under such circumstances, 
when the unexpected interest differential reflects “news” concerning 
inflationary expectations, the coefficient @ in equation (9) is likely to 
be positive.” Table 3 reports the two-stage least-squares estimates of 
equation (9) for the three exchange rates over the period June 1973— 
July 1979.23 As may be seen in all cases, the coefficients of the unex- 
pected interest differential are positive and, in the case of the dollar/ 
pound exchange rate, the coefficient is statistically significant. In 


2 In general, of course, the sign of the coefficient a depends on the source of the 
variation in the interest rate. 

23 In all cases, the lagged forward exchange rate was included as an instrument in 
order to obtain consistent estimates; see Nelson 1975. Adding lagged values of the 
percentage changes of the domestic and the foreign money supplies as determinants of 
the expected interest differential and adding the current values of the percentage 
change of the money supplies as instruments for the unexpected interest differential 
did not affect the results in any material way. In order to obtain consistent estimates, 
the assumption made in the regressions reported in tables 3-6 is that, in forming 
expectations concerning the interest differential, individuals look only at the lagged 
forward premium and at past values of the differential. An alternative way to compute 
the expected differential would use data on the term structure of interest rates. Since 
data on the differential of 2-month rates are not readily available, this computation 
would require interpolations. 
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order to verify the importance of using the series of innovations in the 
interest differential, table 3 also reports estimates of regressions 
which replace the innovations by the actual series of the interest 
differential. In all cases the coefficients of the actual interest differen- 
tial do not differ significantly from zero. Table 4 describes analogous 
two-stage least-squares estimates of regressions which include both 
the actual interest rate differential and the innovation in the differ- 
ential.”* Again, in all cases the coefficients on the actual differential do 
not differ significantly from zero, while the coefficients on the inno- 
vations are all positive and are significant for the dollar/pound and 
the dollar/franc exchange rates. 

One possible interpretation of the positive coefficient on the (unex- 
pected) interest differential may be given in terms of the prediction of 
the monetary approach to the exchange rate, in which case the esti- 
mate of the coefficient a in equation (9) might be interpreted as an 
estimate of a structural parameter. Under an alternative interpreta- 
tion the innovations in the interest differential belong in equation (9) 
only as far as they manifest “news” which is relevant for exchange- 
rate determination. If, for example, the dominant element of “news” 
was variation in inflationary expectations, then one could also use the 
innovations in other time series as long as they reflect the relevant 
“news.” To examine this possibility the same regressions as in tables 
3—4 were estimated using the 12-month interest differential, and the 
results are reported in tables 5-6. As before, in all cases the 
coefficients on the actual differential do not differ significantly from 
zero, while in all cases the coefficients on the innovations in the 
differential are positive and significant for the dollar/pound and the 
dollar/franc exchange rates.” 

On the whole the record shows that during the 1970s exchange 


* The difficulties in obtaining instruments for data that are innovations are obvious 
since, by virtue of being “news,” it is unlikely that variables which characterize the 
history can serve as good instruments. The difficulties are acute in cases where a 
variable and its expected value appear in the same regression, in which case consistent 
estimates require the use of an instrument that is contemporaneous with the innovation 
and is exogenous (see McCallum 1979). Thus, in addition to a constant and the lagged 
forward exchange rate, Durbin’s rank variable was used as an instrument for estimating 
the innovations in the interest rate differential. 

* To allow for the possibility that the exchange-rate equation includes both the short- 
and the long-term interest rates where, as suggested by Frankel (1979), the former 
captures liquidity effects and the latter captures expectations effects, eq. (9) was also 
estimated using the innovations in both the 1-month and the 12-month interest differ- 
ential on the right-hand side. Since the two sets of innovations are highly collinear, 
none of the coefficients differed significantly from zero. In this context it is also 
noteworthy that the 12-month interest rate contains elements of the 1-month rates, due 
to the characteristics of the term structure of interest rates. As a result, coefficient 
estimates from regressions which use both rates, like those in Frankel (1979), must be 
interpreted with great care. 
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rates and the interest rate differential have been associated positively, 
thus indicating that during that inflationary period the same factors 
which induced a rise in the interest differential also induced a rise in 
the spot exchange rates. Furthermore, consistent with the hypothesis 
that current changes in exchange rates are primarily a response to 
new information, the evidence shows the importance of the innova- 
tions in the interest differential. 

The principle that current exchange rates already reflect expecta- 
tions concerning the future course of events implies that unantici- 
pated changes in exchange rates are primarily due to innovations. 
Since the empirical work suggests that most of the actual changes in 
exchange rates are unanticipated, it follows that most of the actual 
changes in exchange rates are due to “news.” In the present section 
this principle was applied to the analysis of the relationship between 
exchange rates and interest rate differentials. The principle, however, 
is general. For example, it implies that the relationship between a 
deficit in the balance of trade and the exchange rate depends crucially 
on whether the deficit was expected or not. A deficit that was expected 
may have no effect on the exchange rate, since the latter already 
reflected these expectations. In contrast, an unexpected deficit in the 
balance of trade may contain significant new information that is likely 
to be accompanied by large changes in the exchange rate.”° This 
distinction might be useful in interpreting the weak and unstable 
relationship between the balance of trade and the exchange rate 
without having to rely on explanations like the J curve or on variable 
import and export elasticities. 


IV. Exchange Rates and Prices 


One of the striking facts concerning the relationship between prices 
and exchange rates during the 1970s has been the poor performance 
of the predictions of the simple versions of the purchasing power 
parity doctrine. The originators and proponents of the purchasing 


26 For a further elaboration on the relationship between exchange rates and the 
current account see Kouri (1976); Branson (1977); Branson, Halttunen, and Masson 
(1977); Dornbusch and Fischer (1980); and Rodriguez (1980). For a special emphasis on 
the role of innovations in the trade balance see Mussa (1979c), and for empirical 
evidence see Dornbusch (1980) and Hakkio (1980). It should be noted that the empiri- 
cal work on the association between exchange rates and current account innovations 
faces some difficulties since, in contrast with data on financial variables like interest 
rates, data on the current account are not available at short intervals. Furthermore, 
findings on the association between exchange rates and current account innovations 
should be interpreted with care since, rather than reflecting the sensitivity of exchange 
rates to “news,” they might just reflect invoicing practices according to which U.S. 
exports are invoiced in terms of U.S. dollars, while imports are invoiced in terms of 


foreign currencies. 
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power parity doctrine (Wheatley and Ricardo during the first part of 
the nineteenth century and Cassel during the 1920s) have viewed the 
doctrine as an extension of the quantity theory of money to the open 
economy. By now the consensus seems to be that, when applied to 
aggregate national price levels, purchasing power parities can be 
expected to hold in the long run if most of the shocks to the system 
are of a monetary origin which do not require changes in relative 
prices. To the extent that most of the shocks reflect “real” changes 
(like differential growth rates among sectors), the required changes in 
sectoral relative prices may result in a relatively loose connection 
between exchange rates and aggregate price levels. The experience 
during the 1970s illustrates the extent to which real shocks (oil em- 
bargo, supply shocks, commodity booms and shortages, shifts in the 
demand for money, differential productivity growth) result in sys- 
tematic deviations from purchasing power parities. As illustrated in 
figures 1-3, short-run changes in exchange rates have not been 
closely linked to short-run differentials in the corresponding national 
inflation rates as measured by consumer price indices. Furthermore, 
this loose link seems to be cumulative. As illustrated in figures 4—6 
divergences from purchasing power parities, measured in terms of 
the relationship between exchange rates and the ratio of consumer 
price indices, seem to persist. 

The link between prices and exchange rates is illustrated in table 7, 
which reports the results of regressions of the various exchange rates 
on the corresponding ratios of wholesale and of cost-of-living price 
indices. As may be seen, the results of the regressions which involve 
the U.S. dollar are extremely poor. For the dollar/pound and the 
dollar/franc exchange rates, the estimates of the coefficients of the price 
ratios are insignificant and, for the dollar/DM exchange rate, the es- 
timates differ significantly from unity. In contrast the results of the 
regressions of exchange rates that do not involve the U.S dollar or the 
U.S. price level (the pound/DM and the franc/DM exchange rates) are 
superior: Except for the wholesale price indices in the franc/DM 
regression all the coefficients are highly significant and the elasticities 
of the exchange rates with respect to the various price indices do not 
differ significantly from unity. 

The vast difference in the performance of the regressions for the 
various currencies can be explained by noting that first, due to trans- 
port cost, purchasing power parities are expected to hold better 
among the neighboring European countries than among each of 
these countries and the United States; second, changes in commercial 
policies and nontariff barriers to trade seem to have been more stable 
within Europe than between Europe and the United States; third, 
within Europe the snake agreement and later on the European Mon- 
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TABLE 7 


PURCHASING POWER ParRITIES: INSTRUMENTAL VARIABLES MONTHLY DaTa: JUNE 
1973—Juty 1979 
(Standard Errors in Parentheses) 











Dependent 

Variable In S, Constant In(@Pe Px In(PAP) SE D-W p 
m2 165 ro 027. 163 ~—-«.963 
149) (507) 

Doll ( 

ollar/pound ise « 1.070 029 166 .998 
(2.978) (.897) 
—1.521 184 ne 029 226 863 
(027) (.374) 

Dollar/franc ee nye ~1.070 029 230 901 
(.047) (.817) 
—.900 1.786 ae 034. «1.69 ~—«.739 
(.018) (230) 

Dollar/DM | 30 ~ 2.917 03 TOG UNE TSO 
(.175) (263) 
Ey GES 821 i 027 160 895 
(041) (.144) 

Pound/DM | {85 me 965 027 1.57  .909 
(048) (197) 
863 —.026 ev 020 161 981 
(143) (.487) 

Franc/DM 602 si 1.180 019 148 ~.929 
(048) (327) 





Nore.—The logarithm of the spot exchange rate is denoted by In S,; In(P,,/P%*) and In(P,/P*) denote, respec- 
tively, the logarithms of the ratios of the wholesale price indices and the cost of living indices. Cochrane-Orcutt 
iterative technique with a two-stage least-squares estimation method was used; the instruments are a constant, time, 
time squared, and lagged values of the dependent and independent variables. SE is the standard error of the 
equation, and p denotes the first-order autocorrelation coefficient. 


etary System have resulted in a reduced degree of intra-European 
flexibility of exchange rates; and fourth, there seem to have been 
large changes in the equilibrium real exchange rate between the U.S. 
dollar and the European currencies.”’ It should be noted, however, 
that to some extent the overall poor performance of the purchasing 
power parities doctrine is specific to the 1970s. During the floating 
rates period of the 1920s, the doctrine seems to have been much more 
reliable.”8 

The preceding discussion accounted for the persisting deviations 
from purchasing power parities in terms of changes in real factors 
which affect equilibrium relative price structure. It should be noted, 
however, that even in the absence of such changes there is a pre- 
sumption that, at least in the short run, exchange-rate fluctuations 


27 The failure of the regression of the franc/DM exchange rate on the ratio of the 
wholesale price indices is explained in terms of the large changes in the French 
intersectoral relative prices; for an elaboration see Frenkel (1981). 

28 For evidence see Frenkel (1976, 1978, 1980a) and Krugman (1978). 
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would not be matched by corresponding fluctuations of aggregate 
price levels. The discussion in Section II emphasized that in periods 
which are dominated by “news” which alters expectations, exchange 
rates (and other asset prices which are traded in organized markets) 
are expected to be highly volatile. Aggregate price indices, on the 
other hand, are not expected to reveal such a degree of volatility, 
since they reflect the prices of goods and services which are less 
durable and, therefore, are likely to be less sensitive to the “news” 
which alters expectations concerning the future course of events. It 
follows, therefore, that in periods during which there is ample “news” 
which causes large fluctuations in exchange rates, there will also be 
large deviations from purchasing power parities.”9 

The difference between the characteristics of exchange rates and 
national price levels is also reflected in their time-series properties and 
is fundamental for interpreting the deviations from purchasing 
power parities. The monthly changes in exchange rates exhibit little 
or no serial correlation, while national price levels do exhibit a degree 
of serial correlation. The “stickiness” exhibited by national price levels 
need not reflect any market imperfection but rather it may reflect the 
costs of price adjustment which result in the existence of nominal 
contracts of finite length. Likewise, it may reflect the results of confu- 
sion between relative and absolute prices and confusion between 
permanent and transitory changes. This difference between the 
time-series properties of exchange rates and prices is reflected in the 
low correlation between the practically random month-to-month 
exchange-rate changes and the serially correlated differences be- 
tween national rates of inflation. 

The different degrees of volatility of prices and exchange rates are 
illustrated in table 8, which reports the average absolute monthly 
percentage changes in the various exchange rates and prices. As is 
evident, the mean absolute change in the various spot exchange rates 
has been about 2 percent per month (and even slightly higher for the 
changes in the forward rate). The magnitudes of these changes have 
been more than double the magnitudes of the changes in most of the 
various price indices, as well as in the ratios of national price levels. 
For example, the mean monthly change in the cost-of-living price 
index was 0.4 percent in Germany, 0.7 percent in the United States, 


9 On this see Mussa (19792). It is noteworthy that the emphasis in the text has been 
on the words “large fluctuations”; this should be contrasted with periods during which 
there are large secular changes in the exchange rate (like the changes which occurred 
during the German hyperinflation). During such periods the secular changes do not 
stem necessarily from “news” and need not be associated with deviations from pur- 
chasing power parities. 
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0.9 percent in France, and 1.2 percent in the United Kingdom. These 
differences are even more striking for the detrended series. 

The notion that exchange rates have been volatile is clearly illus- 
trated by figures 1-3 and by table 8. The comparison of the mag- 
nitudes of the changes in the exchange rates with the magnitudes of 
the changes in the price indices and in the ratios of national price 
levels may suggest, according to a narrow interpretation of the pur- 
chasing power parity doctrine, that exchange-rate fluctuations have 
been “excessive.” The previous discussion, however, has emphasized 
that exchange rates, being the relative prices of assets, are funda- 
mentally different from the price indices of goods and services and, 
therefore, are expected to exhibit a different degree of volatility in 
particular during periods that are dominated by “news.” An alterna- 
tive yardstick for measuring the degree of exchange-rate fluctuations 
would be a comparison with prices of other assets. Indeed, while 
exchange-rate changes have been large relative to changes in national 
price levels, they have been considerably smaller than changes in the 
prices of other assets like gold, silver, many other commodities that 
are traded in organized markets, and common stocks. For example, 
table 8 also reports the mean absolute monthly percentage change in 
stock-market indices. As may be seen, the mean monthly change in 
these indices ranged from over 3 percent in Germany to over 6 
percent in the United Kingdom. By these standards it is difficult to 
argue that exchange rates have been excessively volatile. 

Given the short-run deviations from purchasing power parities, it 1s 
relevant to explore whether these deviations tend to diminish with 
time or tend to persist or even grow in size. In order to examine the 
patterns of the deviations, the autocorrelation functions and the par- 


TABLE 8 


MEAN ABSOLUTE PERCENTAGE CHANGES IN PRICES AND EXCHANGE RATES 
MonTHLy Data: JUNE 1973—JuLy 1979 


eee 
VARIABLE 
ese ee ee ee 


Exchange Rates 
against the Dollar 








Stock NT 
COUNTRY WPI CONG Market Spot Forward COL/COLys 
Us: .009 .007 .037 aoe Oe oh 
Ke .014 .012 .066 021 rO2II .007 
France 011 .009 .054 .020 .021 .003 
Germany .004 .004 .030 .024 .024 .004 





Note.—All variables represent the absolute values of monthly percentage changes in the data. WPI denotes the 
wholesale price index and COL denotes the cost-of-living index. Data on prices and exchange rates are from the IMF 
tape (May 1979 version). The stock market indices are from Capital International Perspective (monthly issues). 
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tial autocorrelation functions of these deviations for the wholesale 
and the cost-of-living price indices have been computed. The devia- 
tion from purchasing power parities during month t is denoted by A, 
and is defined as: 


A; — In Sy a In(P/P*),. (10) 


Figures 11-13 illustrate the patterns of the deviations for the three 
exchange rates. As may be seen, the general pattern is very similar for 
the three exchange rates and for the two price indices. In all cases the 
autocorrelation function tails off at what seems to be an exponential 
rate, and in all cases the partial autocorrelation function shows a spike 
at the first lag. This pattern seems to indicate (as might have been 
expected on the basis of the time-series properties of exchange rates 
and price indices) that the deviations from purchasing power parities 
follow a first-order autoregressive process. It is noteworthy, however, 
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Fic. 11.—A, The dollar/pound: deviations from PPP with wholesale price indices. B, 
The dollar/pound: deviations from PPP with cost of living indices. 
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Fic. 12.—d, The dollar/franc: deviations from PPP with wholesale price indices. B 
The dollar/franc: deviations from PPP with cost-of-living indices. 
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Fic. 13.—A, The dollar/DM: deviations from PPP with wholesale price indices. B, 
The dollar/DM: deviations from PPP with cost-of-living indices. 


that in all cases the value of the autoregression term is about .9, 
indicating the possibility that the series may not satisfy the stationarity 
requirement. To allow for this possibility, the autocorrelation func- 
tions and the partial autocorrelation functions of MA. that 1s;,0f 
the first difference of the deviations from purchasing power parities, 
have also been examined. The results indicate that these differences 
are serially uncorrelated and thus imply that the deviations A; follow a 
random-walk process. In view of this possibility it is concluded that 
the deviations from purchasing power parities seem to follow a first- 
order, autoregressive process, but that the data do not provide 
sufficient evidence to reject the alternative hypothesis of a random 
walk.*° 


V. Concluding Remarks 


This paper examined some aspects of the operation of flexible ex- 
change rates. The analysis was based on the experience of the 1970s. 
The principal conclusions which may be drawn from the empirical 
work are: 


30 If the deviations follow a random-walk process, then they do not entail (ex ante) 
unexploited profit opportunities. For a study of the deviations from purchasing power 
parities see Roll (1979). For an analysis of equilibrium deviations from purchasing 
power parities see Saidi (1977). It may be noted that the main difference between 
accepting the AR(1) rather than the random-walk hypothesis relates to the economic 
interpretation of the two alternative processes. The random-walk process implies that 
deviations from purchasing power parities do not tend to diminish with the passage of 
time, while the stable AR(1) process implies that there are mechanisms which operate to 
ensure that, in the long run, purchasing power parities are satisfied. For the purpose of 
forecasting the near future, however, there is very little difference between using the 
AR(1) process with an autoregressive coefficient of 0.9 and using the random-walk 


process. 
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i) In spite of the extraordinary turbulence in the markets for 
foreign exchange, it seems that, to a large extent, the mar- 
kets have operated efficiently. It should be emphasized, 
however, that the concept of “efficiency” used in this context 
is somewhat narrow in that it only refers to the notion that 
the markets do not seem to entail unexploited profit oppor- 
tunities. A broader perspective should deal with the social 
cost of volatility in terms of the interference with the effi- 
ciency of the price system in guiding resource allocation as 
well as with the cost of alternative outlets for the disturbances 
that are currently reflected in the volatility of exchange rates. 
As for the choice among alternative outlets for the distur- 
bances, one may argue that since the foreign-exchange mar- 
ket is a market in which risk can be bought and sold relatively 
easily, it may be reasonable to concentrate the disturbances 
in this market rather than transfer them to other markets, 
such as labor markets, where they cannot be dealt with in as 
efficient a manner. 


i) The high volatility of exchange rates (spot and forward) 
reflects an intrinsic characteristic of the relative price of mon- 
ies and other assets that are traded in organized exchange. 
The price of gold, the price of stocks, as well as exchange rates 
between national monies depend critically on expectations 
concerning the future course of events and adjust rapidly in 
response to new information. In this perspective, the ex- 
change rate (in contrast with the relative price of national 
outputs) is being viewed as a financial variable which is de- 
termined in a macroeconomic setting. 


iii) During inflationary periods, variations in nominal rates 
of interest are dominated by changes in inflationary expec- 
tations; as a result, high nominal rates of interest are asso- 
ciated with high exchange rates (a depreciated currency). 
This relationship was supported by the empirical work. 


iv) The asset view of exchange-rate determination implies 
that “news” is among the major factors which influence 
changes in exchange rates. In this context the key finding 
was the dependence of exchange-rate changes on the unex- 
pected changes in the rates of interest. This finding is in 
accord with the analytical prediction that current exchange 
rates already reflect current expectations about the future, 
while changes in the current exchange rates reflect primarily 
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changes in these expectations which, by definition, arise 
from new information. 


v) The experience of the 1970s does not support the pre- 
dictions of the simple version of the purchasing power parity 
doctrine which relates the values of current measured prices 
to current exchange rates. The empirical work showed that 
deviations from purchasing power parities can be charac- 
terized by a first-order autoregressive process. 


vi) One of the key analytical insights that is provided by the 
monetary (or the asset-market) approach to the exchange 
rate is that exchange rates do not reflect only current circum- 
stances but also reflect those circumstances which are ex- 
pected to prevail in the future. This anticipatory feature of 
the exchange rate (which is emphasized by Mussa [19795]) 
does not characterize (at least to such a degree) the prices of 
national outputs, which reflect to a large extent present and 
past circumstances as they are embedded in existing con- 
tracts. Consequently, periods which are dominated by large 
and frequent changes in expectations are likely to be periods 
in which the future is expected to differ greatly from the 
present and the past. Under such circumstances one may 
expect to find frequent deviations from purchasing power 
parities when the latter are computed using current prices. 
These deviations reflect the intrinsic difference between 
asset prices and national price indices.** 


vii) Since commodity prices do not adjust fully in response 
to exogenous shocks, it seems that intervention in the 
foreign-exchange market, which ensures that exchange rates 
conform with purchasing power parities, would be a mis- 
taken course of policy. When commodity prices are slow to 
adjust to current and expected economic conditions, it may 
be desirable to allow excessive adjustment in some other 
prices. Further, changes in real economic conditions requir- 
ing adjustment in the equilibrium relative prices of different 
national outputs occur continuously. An intervention rule 
which links changes in exchange rates rigidly to changes in 


31 Tt is interesting to note that this phenomenon was recognized by Gustav Cassel— 
the most recognized proponent of the purchasing power parity doctrine—according to 
whom: “The international valuation of the currency will, then generally show a ten- 
dency to anticipate events, so to speak, and become more an expression of the internal 
value that the currency is expected to possess in a few months, or perhaps in a year’s 
time” (Cassel 1930, pp. 149-50). 
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domestic and foreign prices in accord with purchasing power 
parity ignores the occasional need for equilibrating changes 
in relative prices. 


Appendix 
1. Exchange Rates 


The spot exchange rates are end-of-month rates obtained from the IMF tape 
(May 1979 version, updated to July 1979 using the November 1979 issue of 
the International Financial Statistics) obtained from the International Monetary 
Fund. 

The forward exchange rates are end-of-month rates for 1-month maturity. 
The forward rates for the U.K. pound and the DM for the period June 
1973-June 1978 are bid prices obtained from the International Money Mar- 
ket (IMM). For the period July 1978—July 1979 they are sell prices obtained 
from the Wall Street Journal. The forward rates for the French franc for the 
period June 1973-July 1974 are bid prices calculated from the Weekly Review 
publication of the Harris Bank which reports the spot rate and the forward 
premium; in each case the closest Friday to the end of the month was chosen. 
For the period August 1974—June 1978 the rates are bid rates obtained from 
the IMM, and for the period July 1978—July 1979 they are sell prices obtained 
from the Wall Street Journal. 


2. Prices 


The wholesale and cost-of-living price indices are period averages obtained 
from the IMF tape, lines 63 and 64, respectively. 


3. Rates of Interest 


All interest rates are 1-month Eurocurrency rates obtained from the Weekly 
Review of the Harris Bank. In all cases the figures used correspond to the last 
Friday of each month. 


4. Stock Markets 


The stock market indices correspond to the last trading day of the month. 
The sources are Capital International Perspective (Geneva, Switzerland, monthly 
issues). 
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This paper uses historical U.S. data to directly estimate the contri- 
bution of intergenerational transfers to aggregate capital accumula- 
tion. The evidence presented indicates that intergenerational trans- 
fers account for the vast majority of aggregate U.S. capital forma- 
tion; only a negligible fraction of actual capital accumulation can be 
traced to life-cycle or “hump” savings. A major difference between 
this study and previous investigations of this issue is the use of more 
accurate longitudinal age-earnings and age-consumption profiles. 
These profiles are simply too flat to generate substantial life-cycle 
savings. This paper suggests the importance of and need for sub- 
stantially greater research and data collection on intergenerational 
transfers. Life-cycle models of savings that emphasize savings for 
retirement as the dominant form of capital accumulation should 
give way to models that illuminate the determinants of intergenera- 
tional transfers. 


During the past quarter century, the life-cycle hypothesis (Modig- 
liani and Brumberg 1954; Ando and Modigliani 1963) has emerged 
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as the principal theory of aggregate saving in the American economy. 
It has stimulated a vast quantity of research testing and exploring its 
implications. Despite this effort, the importance of lifetime consump- 
tion smoothing to the process of capital accumulation remains unre- 
solved.' This paper presents evidence that the pure life-cycle compo- 
nent of aggregate U.S. savings is very small. American capital 
accumulation results primarily from intergenerational transfers. 
Distinguishing the roles of life-cycle savings and intergenerational 
transfers in the capital accumulation process is crucial to a number of 
economic issues. Economic models that incorporate intergenerational 
transfers can generate strikingly different results for a number of 
major economic questions, including the burden of the national 
debt, the impact of social security on savings, the incidence of tax- 
ation, the perpetuation of the inequality of wealth, and the optimal 
structuring of taxes to promote economic growth (see, e.g., Dia- 
mond 1970; Atkinson 1971; Feldstein 1974; Oulton 1976; Calvo, 
Kotlikoff, and Rodriguez 1979). 

This paper has two objectives. The first is simply to answer an 
accounting question, namely, can life-cycle savings alone account for 
the U.S. capital stock? The second goal is to answer an economic 
question: If, ceteris paribus, there were no intergenerational trans- 
fers, how large would the U.S. capital stock be? That is, if all such 
transfers were taxed in a confiscatory way, by how much would capital 
accumulation be reduced? We find that life-cycle savings cannot ex- 
plain the capital stock in an accounting sense and that in the absence 
of intergenerational transfers the U.S. capital stock would be substan- 
tially smaller. 

Our findings are based on a new methodological approach.’ In 
particular, a variety of historical U.S. data detailing population, labor 
earnings, consumption, and government taxes and transfers are used 
to directly estimate the shapes of historic age-earnings and age- 
consumption profiles. These profiles are combined with data on rates 
of return to calculate a stock of life-cycle wealth. This stock of life- 
cycle wealth is compared with aggregate wealth holdings in the 
United States. If there were no intergenerational transfers, the stock 
of life-cycle wealth would exactly equal total U.S. wealth. When in- 
tergenerational transfers occur, these two stocks differ by an amount 
equal to the stock of net received transfers. Hence, comparing total 
wealth with life-cycle wealth indicates whether the life-cycle model 


1 Tobin (1967) and Boskin (1978) argue that life-cycle savings are the predominant 
form of savings in the United States. White (1978) and Darby (1979) argue otherwise. 
Tobin and Dalde (1971) provide a somewhat intermediate view. 

2 Our methodological approach was influenced by and is similar to those of Brittain 


(1978) and Darby (1979). 
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alone can explain aggregate U.S. capital formation. We find that 
lifetime consumption and earnings profiles do not exhibit the kind of 
shapes needed to generate substantial life-cycle savings. Log linear 
approximations to these profiles suggest that growth rates of real 
earnings slightly exceed growth rates of real consumption over the 
lifetime. Since the life-cycle theory strongly relies on a lifetime growth 
rate of consumption in excess of the growth rate of earnings, the 
life-cycle theory of savings with no intergenerational transfers is a 
very poor description of the process of capital accumulation in the 
U.S. economy. 

Our findings are in agreement with recent studies by White (1978) 
and Darby (1979) and rationalize other stylized facts about the U.S. 
economy at odds with the life-cycle theory. Mirer presents evidence 
from social security data that “the aged do not run down their wealth 
during their lifetime” (1979, p. 442). Indeed, after he adjusts for 
intercohort differences in wealth at retirement, Mirer finds “that 
wealth clearly increases with age” (1979, p. 442). Darby points out that, 
although the ratio of expected retirement years to expected life span 
increased by 67 percent from 1890 to 1930, aggregate savings rates 
showed no increase during this period as would be predicted by the 
life-cycle theory (1979, pp. 22-28). Atkinson (1971) and Oulton 
(1976) construct life-cycle models to determine how much of observed 
British inequality of wealth may be explained by this theory. The 
answer is, very little. After taking into account inequality in age- 
earnings profiles and realized rates of return, Oulton concludes, “The 
results indicate that none of these factors, either singly or in combina- 
tion, are capable of accounting for a substantial proportion of actual 
wealth inequality” (1976, p. 99). 

Section I presents a theoretical framework for considering the 
importance of intergenerational transfers to aggregate capital ac- 
cumulation. Section II discusses the procedure to estimate the stock of 
life-cycle wealth. The data are described in Section III. Section IV 
presents and intuitively motivates the findings. The sensitivity of the 
findings to reasonable possible errors in the data and estimation 
procedure is also considered. The fifth section attempts to account for 
the residual between total U.S. wealth and the stock of life-cycle 
wealth using the limited information available on actual intergenera- 
tional transfers. The sixth and final section of the paper presents the 
conclusions and discusses some of their implications. 


I. Life-Cycle and Transfer Wealth—a Theoretical Framework 


The division of the stock of wealth, W, into life-cycle and transfer 
components, L and T, respectively, is easily understood by consider- 
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ing a 3-period model of economic growth with identical individuals in 
each age cohort. At a point in time, aggregate wealth in the economy 
is the sum of individual wealth holdings. Let W; stand for the wealth 
holdings of a representative individual of age i, and let P; stand for 
the population of individuals age 7; then 


W =W,P, + W2P2 + W3P3. (1) 


The W;’s equal, by definition, accumulated flows of net received 
transfers plus accumulated flows of earnings net of government taxes 
less government transfers less accumulated flows of consumption, 
where accumulation takes place at the after-tax interest rate r. Let Tf; 
be the net transfer received at age 7 from individuals age 7 for indi- 
viduals who are currently age k. For example, T?, represents the net 
transfer (which may be positive or negative) that a current 3-period- 
old individual received at the time he or she was 1 period old from 
individuals who were then 2 periods old. Let ef and c¥ represent the 
earnings and consumption of k-period-old individuals when they 
were 7 periods old. Assuming that earnings, consumption, and trans- 
fers occur at the end of each period, W,, the wealth of age 1 individu- 
als, equals zero. With this notation, W, and W, may be expressed as: 


W. = (Ti, + T7,)01 + 7) + (4 — cf) +7) 
Wate Teer he (Tt Tee) (leer) (2) 
Fal utete let Tye Ce ae) er), 
Combining (1) and (2) yields: 
W=T+HL, (3) 


muere T =(Te + Till + Ppt i(Tis + Tig) + ry + (Th + Th) 
Vie ie wand bo—s(ez ce rs + bed — 07) + 1)? + Fi eis (A) 
(e3 — c3)]P3. 

Equation (3) is the fundamental accounting relationship analyzed in 
this paper. Transfer wealth, T, corresponds to accumulated net re- 
ceived transfers, while life-cycle wealth, L, corresponds to accumu- 
lated earnings less accumulated consumption. Clearly, in a world with 
no intergenerational transfers, T would equal zero and W would equal 
L. Our first goal is to establish the relative magnitudes of the two 
components T and L and thereby determine whether U.S. wealth 
holdings can be predominantly explained by life-cycle savings. Since 
substantially less information is available about the values of the T*, 
most of this paper is devoted to calculating the value of L. Section V 
does, however, attempt to estimate T directly using fragmentary data 
and invoking steady-state assumptions. 

If the economy is in a steady state, net intergenerational transfers 
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received at a given age are constant through time, so that: 


li Tis = T 3. 

T13 a Tis = Tis (4) 
oe Thy 

T23 = T33. 


Under the assumption that population grows at a constant rate n, 
Pr=P.1 ny and Py= Pol £7). (5) 


Since transfers received by age group z from age group equal the 
negative of group j’s transfers from age group 2, 


Lig eg (6) 
and 

TyP, = —T oP. (7) 
From (4), (5), (6), and (7), T may be expressed in the steady state as: 


fe ey (teary 
Tet Tel yeaa he 
(leery (lit ®) 
ee a 


Under the assumption that r = n, T equals the yearly net flow of 
transfers from old to young cohorts, ¢, multiplied by the transfer- 
weighted age gap, g, between donors and recipients: 


f= (Pi (his + 143). Polos) = as a aa SS SD) 


If r exceeds (is less than) n, accumulated transfer wealth will exceed 
(be less than) the annual flow of transfers times the weighted average 
age gap. Equations (8) and (9) show that the contribution of transfers 
to the total stock of wealth depends critically on both the volume of 
the annual flow of transfers and the age span of transfers. Equation 
(8) is invoked in Section V to directly estimate the size of T. 

The second goal of this paper is to ask the economic question, How 
would the U.S. capital stock, W, change if, because of changes in taxes 
or tastes, intergenerational transfer wealth, T, was reduced? Since 
consumption, leisure, and transfer decisions are jointly determined it 
is important to examine the possibility that changes in transfers might 


* By examining (8), one can see that T corresponds simply to the accumulated 


holdings of transfers received by people who are currently alive from people who are 
currently dead. 
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induce or be associated with changes in consumption and earnings 
paths and thus alter the life-cycle component of wealth. In addition to 
recognizing interdependencies between transfer wealth and life-cycle 
wealth, the effect of changes in transfer behavior on capital formation 
may be examined in both partial and general equilibrium contexts. 
The partial equilibrium change in W resulting from a reduction in T 
holds wage rates and interest rates constant and corresponds to a shift 
in the household supply curve of capital. This paper considers only 
the steady-state partial equilibrium impact of changes in transfers on 
the capital stock. A calculation of the general equilibrium effect re- 
quires taking account of the responsiveness of the level and shapes of 
earnings, consumption, and lifetime transfer profiles to changes in 
capital accumulation.* 


The Steady-State Partial Equilibrium Reduction in Capital Intensity Arising 
from a Reduction in Intergenerational Transfers 


In order to analyze the partial equilibrium reduction in the stock of 
wealth, we first note that life-cycle wealth, L, equals accumulated 
earnings minus accumulated consumption. Accumulated earnings, in 
turn, equals accumulated wages at full-time work minus the accumu- 
lated value of leisure. Letting C stand for accumulated consumption, 
S for accumulated full-time wages, and M for the accumulated value 
of leisure, equation (3) becomes: 


W=T+S—-M—=C. (3) 


The partial equilibrium percentage reduction in W for a percentage 
change in transfer wealth, 7, equals: 


ee ee is 


or WW, OT) W wy, 


Olen Or 
Equation (10) states that the proportionate reduction in the capital 
stock equals the share of transfer wealth in aggregate wealth, T/W, 
minus two additional terms indicating how accumulated earnings and 
accumulated consumption respond to changes in transfers. 

Our analysis of the response terms 0M/0T and 0C/0T assumes that 
the utility of consumption and leisure is separable from the utility 
derived from intergenerational transfers. This assumption implies 
that the marginal rates of substitution between consumption and 
leisure at different points in time are independent of the level of 


4 Kotlikoff (1979) demonstrates that general equilibrium changes in capital intensity 
can be substantially smaller than partial equilibrium changes. __ 

> Because wage rates are held constant in this partial equilibrium analysis, 0S/dT 
equals zero. 
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intergenerational transfers. Two examples of utility functions exhib- 
iting this property are: 
Uy = log Cy + log C, + log C, + log J, 

+ log l, + log 1; + a log T,, + a log Ty3 (11) 

+ a log Tz; + a log T23 + a log T3,; + a log Tse 
and 

Uo = log Cy + log C, + log C, + log J, (11) 

+ log J, + log l; + aU,. 


The term Up stands for the utility of a representative individual of 
generation zero. The terms 1,, 2, and /; correspond to leisure in 
different periods. In (11) the individual derives utility directly from 
the level of net transfers. In (11') the individual derives utility from 
the utility, U,, of his descendants. This is an example of an “overlap- 
ping utility function.” For both types of separable utility functions the 
marginal rates of substitution between consumption and leisure are 
independent of the level of a, the preference parameter influencing 
the size of transfers. In addition, the first-order conditions for the 
optimal choice of consumption and leisure involve the equality be- 
tween these marginal rates of substitution and the relative prices of 
consumption and leisure at different ages. Hence, neither changes in 
transfer preferences nor taxes on intergenerational transfers that 
affect only the price of transfers but not the price of consumption or 
leisure will alter the first-order conditions. 

Given the levels of transfers, the utility-maximizing levels of con- 
sumption and leisure can be separately derived from these first-order 
conditions and the lifetime budget constraint given below: 





€ G M M 
Ce 3 —+_ +3 __ = 
Lap Libris oa (Dictex i eek ipa a IRS a ina 
S2 Ss To T 3 
TEP rye bee eel ee ra ere) 
Ts, T 39 
(leet) (Lista riry 


In equation (11), M,, Mz, and M; are the values of leisure in periods 
1, 2, and 3 (by value of leisure we mean the number of units of leisure 
times the price [wage] per unit). The terms s,, s., and $3 are full-time 
wages in periods 1, 2, and 3. Equation (12) indicates that lifetime 
consumption and leisure are financed by full-time lifetime earnings 
plus net lifetime received transfers. The separability assumption im- 
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plies that changes in preferences or taxes that alter the values of the 
Ti; have only an income effect on the choices of consumption and 
leisure over the lifetime. 

We now demonstrate that this income effect of a change in trans- 
fers is identically zero for the case that r equals n.6 Using (5) and (6) to 
rewrite (12), we obtain: 


C C M M 
C, + —— + — 3 4 + — 4 = 
eRe ee late (haa 
Sg 53 T,2(r =e) T23(r — n) Fi 
a aa a le yi 1 UML =A 
TES aSa ees fe re Wee PPP ee) 
pe ella ter (stan) 
ary 


We can also write the budget constraint (12') in terms of aggregate 
transfer wealth per young person T’ where T’ = T/P,: 


C C o M M 
G + 2 + 3 2 3 
Siem sica. et siger) | lr? , 
(12") 
4 So $3 Gait) (lar n)er 
= ee ee ee ae Te ee A es 
eile ci eed Lot)? dtr 


As is clear from (12’) and (12”), when r = n, lifetime consumption 
and leisure are financed solely out of lifetime full earnings; all of the 
accrued interest on received transfers, as well as the principle itself, is 
used to maintain steady-state transfers per head at a constant level. 
When r = n, reducing transfers has no impact on the steady-state 
budget constraint, and the terms 0M/0T and 0C/0T in (10) are zero 
(i.e., since lifetime consumption and earnings paths stay the same 
when r = n, steady-state life-cycle wealth, L, will be unaltered by 
changes in steady-state transfer wealth, T). Thus, any decline in 
transfer wealth, T, will reduce total wealth, W, dollar for dollar in 
partial equilibrium when r = n. 

In the case that r differs from n, steady-state changes in the level 
and pattern of transfers (the 7;; terms) affect the steady-state budget 
constraint only insofar as they alter the stock of transfer wealth T. 
This is a general proposition that can easily be shown to hold inde- 
pendent of the number of periods in the model. When r exceeds n, 
part of lifetime consumption and leisure is financed by lifetime trans- 
fers. Again, under the separability assumption, the reduction in trans- 


6 In the case of a tax on transfers, the income effect of a change in transfers when r = 
n will be zero only for the case of a compensated tax on transfers. Compensated rather 
than uncompensated tax changes are the appropriate focus of studies of government 
tax policies toward savings. See Diamond 1970 and Kotlikoff and Summers 1979. 
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fers has simply an income effect on consumption and leisure and, 
assuming both are normal goods, will reduce lifetime consumption 
and raise lifetime earnings. In (10) both 0M/0T and 0C/0T will be 
positive when r exceeds n. The reverse will be true when n exceeds r. 
It is easy to demonstrate that introducing labor-augmenting produc- 
tivity growth simply requires relabeling n everywhere as the popula- 
tion growth rate plus the productivity growth rate. In the case of 
productivity growth, the steady state is characterized by a constant 
level of transfers per effective worker. 

The value of these terms depends on the particular preferences 
determining the levels and shapes of consumption and leisure paths. 
As one example, table 1 gives the value of the transfer wealth- 
adjustment factor [1 — (@M/dT) — (OC/dT)] for a particular spec- 
ification of preferences for consumption and leisure and for dif- 
ferent values of r and n. The example assumes a 55-year life span 
and uses the logarithmic utility function with time-preference 
parameter p and leisure-preference parameter a:7 


D D 
U -|[ log Cye-P'dt + a] log edt. (13) 
0 0 


The /, terms in (13) are units of leisure at different points in time. The 
calculations underlying table 1 are presented in Kotlikoff and Sum- 
mers (1980, appendix A). 

The figures in table 1 indicate that the response of consumption 
and labor supply can be quite important in determining the final 
partial equilibrium effect of a reduction in transfers when the interest 
rate differs from the economy’s growth rate. For the specific 
logarithmic function chosen here, a real interest rate that is 1 percent 
higher than the rate of population plus productivity growth generates 
total wealth response numbers approximately equal to 0.7. If, on the 
other hand, the rate of population plus productivity growth exceeds 
the real interest rate by 1 percent, reducing transfer wealth by $1.00 
reduces aggregate wealth by $1.40. 

For the United States the annual rate of population growth has 
averaged 1.40 percent from 1900 through 1974, the period of this 
study (see U.S. Department of Commerce 1975, pt. 1, p. 8). Produc- 
tivity growth as measured by the annual percentage change in real 
GNP per man-hour has averaged 2.20 percent over this period. 


’ The utility function in (13) corresponds only to that part of total utility pertaining to 
consumption and leisure. Any set of preferences for transfers may be added to (13) as 
long as utility of consumption and leisure is separable from the utility of transfers. The 
assumption of a steady state with productivity growth also requires that the entire utility 
function be homothetic. 

® U.S. Department of Commerce 1973, series A167 and A168, Pae2lOwandsWES: 
Department of Labor 1978, p. 336. Kendrick’s series of output per man-hour is used 
for the years 1900-1910, and the BLS series thereafter. 
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TABLE 1 


‘TRANSFER WEALTH ADJUSTMENT FACTOR 








Le ae 
és 3 r Oceana Ty, 
02 01 02 73 
02 01 03 wil 
02 02 02 1.00 
02 03 02 1.40 
02 04 02 2.00 
03 01 01 58 
03 01 05 48 
03 02 03 71 
03 03 03 1.00 
03 04 03 1.43 
04 01 01 45 
04 01 07 30 
04 02 04 49 
04 03 04 .70 
04 04 04 1.00 





Adding 1.4 to 2.2 yields 3.6 percent as the rate of population plus 
productivity growth. This paper calculates a portfolio-weighted net 
nominal rate of return for the U.S. economy from 1900 through 
1974. After subtracting inflation, the real annual net after-tax rate of 
return in the U.S. economy averaged 4.5 percent between 1900 and 
1974.” 

The fact that real interest rates have on average exceeded growth 
rates suggests that part of U.S. consumption and leisure may well have 
been financed by interest earned on intergenerational transfers. 
Hence, a dollar reduction in transfer wealth can be expected to 
reduce aggregate wealth by less than $1.00. The calculated 1 percent 
gap between r and n suggests that eliminating transfer wealth in the 
U.S. economy would reduce total wealth by about 70 percent of the 
amount of transfer wealth. While the 70 percent figure is meant to be 
suggestive rather than precise, it appears that almost any choice of 
preferences would yield offset factors not far from 70 percent.'? That 


® To obtain a real net rate of return series, we subtract the annual percentage change 
in the CPI from our net nominal return series indicated in U.S. Department of 
Commerce (1975, pt. 1, pp. 210-11, table B2) and U.S. Department of Commerce 
(1978, p. 490). 

‘© These calculations were performed for the more general isoelastic utility function. 
A wide range of parameter values for rates of risk aversion and time preference 
produced values of [1 — (0M/0T) — (8C/dT)] ranging from 0.6 to 0.75 when r exceeds 
n by 1 percent. While it is unclear what utility function is most appropriate, these 
parameter values cover a wide range of different shapes of consumption and leisure 
profiles and, we presume, bound the likely response. The assumption implicit in these 
utility functions that consumption and leisure respond homothetically to changes in 
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is, the life-cycle offset to a change in transfer wealth is likely to be 
small. 


II. The Estimation of Life-Cycle Wealth—Methodological 
Approach 


Total life-cycle wealth in the U.S. economy equals the sum over all 
living persons in the economy of life-cycle assets. Each person’s life- 
cycle assets correspond to his or her accumulated earnings less ac- 
cumulated consumption where accumulation occurs at the actually 
realized rates of return. If data existed detailing each person’s earn- 
ings, consumption, and realized rate of return on assets at each point 
in time in the past, it would be easy to check whether U.S. wealth 
holdings could be explained predominantly by life-cycle accumula- 
tion. 

Obviously, such detailed individual specific data are not available. 
However, historical data for the United States on aggregate earnings, 
aggregate consumption, rates of return, and age-earnings and age- 
consumption profiles may be used to carry out this life-cycle asset 
computation on a cohort-by-cohort basis. This paper treats individu- 
als of each sex within an age cohort as identical and estimates the 
average excess of after-tax earnings plus government transfers over 
consumption experienced by members of that age-sex cohort during 
each of their adult years in the past. These differences are then 
accumulated up to 1974 using historical net nominal interest rates. 
The total over all age-sex cohorts of these accumulated life-cycle 
assets is then compared with the 1974 value of total U.S. private net 
worth. 

Life-cycle wealth of the age-sex cohort that is age a and sex j in 
1974, L;(a), is given by 


L,(a)= > {(6(a,i) + G,(a, i) - C,(a, i] 


‘ (14) 
I] [1 +r(a, k)]}P;(a). 


transfers may be inappropriate. Even if the utility function is homothetic, capital 
market constraints precluding borrowing when young against future earnings may 
make the consumption and leisure response to a reduction in transfers nonhomothetic. 
These capital market constraints may not be binding in the presence of transfers 
because the excess of consumption over earnings when young is financed by the 
received transfers. Eliminating or reducing transfers could make these capital market 


constraints binding and require proportionately less consumption and leisure when 
young than when old. 
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In equation (13), P;(a) stands for the number of people alive in 
1974 who are age a and sex j. The terms E;(a, i), G;(a, i), and C;(a, i) 
are, respectively, the average after-tax earnings, government trans- 
fers, and consumption of the age a-sex j cohort at the time its 
members were age 7. Age 18 is taken as the age of adulthood. Con- 
sumption expenditures by adults on children under the age of 18 are 
considered to be a part of adult consumption rather than intergener- 
ational transfers.'' The term r(a, k) is the economy-wide annual net 
nominal interest rate received by the age a cohort during the year the 
cohort’s members were k years old. 

Our estimate of total life-cycle assets in the economy in 1974 equals 
the sum over all age-sex cohorts of the estimated values of L;(a).!2 The 
difference between life-cycle wealth and total U.S. wealth in 1974 is 
our estimate of transfer wealth. 


Calculation of Longitudinal Profiles of Net Earnings, Government Transfers, 
and Consumption 


The essential idea involved in these calculations is to use cross- 
sectional distribution profiles to allocate aggregate flows of net earn- 
ings, government transfers, and consumption to different age-sex 
cohorts in a given year. By performing this computation for each year 
from 1900 to 1974, we obtain the longitudinal profiles E;(a,7), Gj (a, 7), 
and C;(a, 7) as 7 varies. e 

To illustrate the computation for E;(a, 7), we define the following 
terms: é; is the average earnings of 40-year-old male workers in year ¢; 
&m(4, t) is the ratio of average earnings of male workers at age a in year 
t to average earnings of 40-year-old male workers in year ¢t; gy(q, t) is 
the ratio of average earnings of female workers at age a in year ¢ to 
average earnings of 40-year-old female workers in year ¢; ; is the 
ratio of average earnings of 40-year-old female workers to average 
earnings of 40-year-old male workers in year t; @m (a, t) is the percent- 
age of males age a with work experience in year ¢; a,(a, t) is the 
percentage of females age a with work experience in year t; H; is total 
after-tax labor income in year t; P,, (a, t) is the population of males age 
a in year t; and P,(a, t) is the population of females age a in year t. 
Given information on all of the above variables except ¢,, equation 
(14) is used to solve for e: 


"| Treating consumption of children under age 18 as their own life-cycle consump- 
tion would greatly reduce our estimate of life-cycle wealth and greatly increase our 
estimate of transfer wealth. The age of adulthood is somewhat arbitrary, but 18 appears 
to be a reasonable number and of general interest. 

12 Assets of children under age 18 are assumed to be completely inherited and are not 


included in life-cycle wealth. 


718 JOURNAL OF POLITICAL ECONOMY 


100 
Hy = & >. [gm(a, t)Om(a, t)Pm(4, t) 
a=18 (15) 
a Agr (a, t)as(a, t)P;(a, t)]. 


Given &, En(a, 7) for males and E;(a, 1) for females satisfy: 
En(a, 2) = Enla, Gy = (1974 Foal t)] = CrLm(a, t)Qm (a, d); (16) 
E,(a, i) = Eyla, a — (1974 — t)] = &Argz(a, thay(a, t). (17) 


The procedure for computing the longitudinal profiles of con- 
sumption, C;(a, 2), is identical to that just described for earnings; a 
cross-sectional profile of relative consumption by age and sex is used 
to distribute aggregate U.S. consumption in each year to different 
age-sex cohorts. 


Calculation of Net Nominal Interest Rates Series 


The interest rate term r(a, 7) on (13) depends only on the year ¢ since 
r(a,i) = r[1974 — (a — i)] =r(t). The computation is performed using 
two different interest rate time series. Series 1 was calculated by 
dividing U.S. wealth holdings into six separate assets, estimating a rate 
of return series for each asset, and then weighting each asset’s rate of 
return by its share in the U.S. portfolio for the particular year in 
question. 
Series 2 is based on the wealth augmentation relation: 


W e434 — W.1 aia Tr) a E, Ste G, any (Ge (18) 


Equation (18) indicates that private U.S. net worth in year t + 1 
equals U.S. net worth in year ¢ plus savings in year ¢. Savings in year t 
is, in turn, equal to net income on assets, 7;W;, plus after-tax labor 
income and government transfers, FE, and G,, less aggregate con- 
sumption in year ¢, C,. Equation (18) is solved for 7; using time-series 
data on W,, E;, G,, and C,. 


III. Description of Data Used to Calculate Life-Cycle Wealth 


This section describes sources and procedures for several of our data 
series. Kotlikoff and Summers (1980) provide a detailed description 
of all the data and adjustments to the data used in calculations. 
Information on compensation of employees after 1929 is obtained 
from the Nateonal Income Accounts and, for the years 1909-29, from 
Leven, Moulton, and Warburton (1934) and Kuznets (1941, 1946). 
Prior to 1909, employee compensation is imputed from Kendrick’s 
estimates of net national product using the ratios of employee com- 
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pensation to national product from 1909 to 1918.13 The imputation of 
labor income to the self-employed follows Christensen (1971). The 
number of proprietors is multiplied by the average earnings of full- 
time equivalent employees. The chief bias here is simply that the 
self-employed may earn more or less on average than employees. To 
insure that the results do not reflect a substantial underestimate of 
entrepreneurial earnings, the estimate of entrepreneurial earnings 
for each year is increased by 20 percent. 

Estimates of state and federal income taxes paid on labor income 
were obtained from IRS Statistics of Income and the National Income and 
Product Accounts. Combined employer and employee social security 
and health insurance taxes were obtained from the Social Security 
Annual Statistical Supplement for various years. 

Estimates of the age-earnings profiles for men and women were 
obtained by fitting separate regressions to social security estimates of 
median annual earnings of workers at different ages for the years 
1950 through 1975."* The general shapes of the profiles predicted by 
the regressions are quite similar throughout the 1950-75 period. For 
the years prior to 1950, the predicted male and female age-earnings 
profiles for the year 1955 were used.” 

The value of 4,;, the ratio of average earnings of 40-year-old female 
workers to average earnings of 40-year-old male workers in year t, was 
taken to be .55 throughout the period. This appears to be an upper- 
bound value for A; (see Kotlikoff and Summers 1980, p. 46, n. 35). 

Values for work experience rates by age and sex, @,(a, t) and a;(a, 
t), are available only after 1959. Substantially more information, espe- 
cially for the early 1900s, is available on labor force participation rates 
by age and sex. Regression analysis for the post-1959 period indicates 
that work experience rates can be predicted quite well by functions of 
age and labor force participation rates. This post-1959 regression was 
used to estimate the a function for each year from 1900 to 1974 based 
on labor force participation rates for the appropriate year. The labor 
force participation rates equal the values predicted from regressions 
of labor force participation rates on fifth-order age polynomials for 
each sex and for different census years. 

While cross-sectional distribution functions by sex and age were 
computed for social security and medicare transfers, we assumed that 


13 See Leven et al. 1934, p. 155; Kuznets 1941, pp. 322-23; and Kendrick’s series A6 
and B61 in U.S. Department of Commerce (1973), pp. 182, 222. Because of limitations 
on population estimates on individuals above age 85, we treat all individuals who were 
above age 85 in 1974 as if they were age 85. Since 85-year-olds in 1974 did not reach the 
age 18 until 1907, we really do not use any data for the years prior to 1907 in this paper. 

14 These and other regression results are reported in Kotlikoff and Summers (1980). 

1° The year 1955 was chosen because by 1955 the social security data cover most of 
the private economy’s work force. 
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other transfers, including veterans’ benefits and welfare payments, 
were distributed in the cross-section over the years 1900-1974 ac- 
cording to that year’s age-earnings distributions. 

Total consumption expenditure, Z;, is taken from National Income 
Accounts after 1929 and from Kendrick prior to 1929. Ideally one 
should subtract expenditures on consumer durables and add imputed 
rent on consumer durables to obtain true economic consumption. 
The difficulties of implementing this for the pre-1929 years led us to 
simply use the consumer expenditure series. In any case, there ap- 
pears to be very little difference between the consumer expenditure 
series and the true economic consumption series."® 

The cross-sectional consumption distribution functions were ob- 
tained from the 1960 and 1972—73 Consumer Expenditure Survey (CES) 
tapes. Unfortunately, similar data are not available to generate these 
profiles for earlier years. Life-cycle wealth is computed using the 
1972-73 cross-sectional profiles throughout the period 1900-1974. 
The 1960 profiles generated essentially the same level of life-cycle 
wealth. 

In distributing total household consumption to household mem- 
bers, household heads and their spouses are assumed to consume 
equally; all other household members, including children, were allo- 
cated 50 percent of the household head’s consumption. The total 
consumption of children under the age of 18 was then reallocated to 
the household head and spouse, assigning each one half of children’s 
consumption in the case of two spouses, or giving all the children’s 
consumption to the household head if he or she was single. The 
general shape of the profiles was quite insensitive to the assumption 
that other household members and children consume only 50 percent 
of the consumption level of the household head. The 50 percent 
assumption generated slightly more life-cycle wealth and is the one 
used. 

Net nominal interest rate series 1 is generated from data on histori- 
cal rates of return and Goldsmith and Lipsey’s (1963) data on 
portfolio shares. From Goldsmith and Lipsey’s balance sheets we 
constructed seven asset categories plus liabilities. These are tangible, 
noncorporate business assets including land and structures, residen- 
tial land and structures, money, short-term claims (savings accounts 
and U.S. Treasury bills), corporate stock, long-term corporate bonds, 
and U.S. savings bonds. A rate of return series was calculated for each 
asset as well as the liabilities. A weighted rate of return was computed 


‘6 Christensen and Jorgenson’s (1973) “true” consumption series (p. 17) is very close 
to the NIA series for the years 1929-69. The Christensen and Jorgenson series is 
slightly higher for most years; using the Christensen and Jorgenson series for the 
post-1929 years would, therefore, lead us to calculate less life-cycle wealth. 
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taking the share of each item in net worth during the period consid- 
ered as the weight. The nominal rates of return were converted to 
after-tax nominal rates of return using average income tax rates for 
each year. The series 2 net nominal interest rates were computed for 
the post-1929 period using equation (22). Reliable net worth data are 
not available until 1929. In the computation the series 2 rates from 
the pre-1929 years are filled in with the series 1 values. 


IV. The Size of Life-Cycle Wealth 


At the end of 1974 total net worth in the U.S. economy equaled 
$4.154 trillion. Of the $4.154 trillion, $134 billion represent the 
tangible assets of nonprofit institutions (see FRB-MIT balance sheets); 
in 1974 nonprofit institutions owned 4.4 percent of corporate equities 
(SEC 1977, p. 11). Under the assumption that this percentage applies 
to all other financial assets, total net worth of nonprofit institutions in 
1974 would equal $270 billion.” After subtracting $270 billion from 
$4.154 trillion, the 1974 net worth figure for the noninstitutional 
household sector is $3.884 trillion. This is the number to compare 
with the life-cycle wealth calculations. 

In addition to presenting total life-cycle wealth of individuals, a 
second life-cycle wealth concept, LCW2, is calculated that adds to 
LCW1 an upper-bound estimate of accumulated interspousal trans- 
fers. Many economists would include within their definition of life- 
cycle wealth the amount of accumulation that arises from interspousal 
transfers. Interspousal transfers give rise to wealth accumulation, 
first, because wives are on average younger than husbands and, sec- 
ond, because females live longer than males. Since it is conceptually 
difficult, if not impossible, to exactly trace individual patterns of 
household formation that involve marriage, divorce, and death, an 
upper-bound estimate of the stock of interspousal accumulated trans- 
fers is added to LCW1. To be specific, at age 21 all males are assumed 
to marry 18-year-old females.’* If either spouse dies prior to reaching 


17 See SEC 1977, p. 11. This assumes that nonprofit institutions have no liabilities. 

18 The 1972-73 CES data indicate that the average age gap between husband and 
wife is 3 years. The number of deaths in an age-sex cohort in a particular year was 
obtained by multiplying age-, sex-, and year-specific death rates from the U.S. census 
bureau’s population estimates. Historical Statistics (U.S. Department of Commerce 1975, 
vol. 1, death rates, series B 181-92, pp. 61-62) are smoothed. Even in the absence of 
marriage, our LCW2 concept would be of interest because it essentially treats individu- 
als as if their wealth at each moment were fully annuitized; i.e., given the uncertainty of 
the date of death, individuals in a life-cycle model with no bequest motive would always 
purchase annuities. Hence, our procedure of essentially passing a cohort’s bequest over 
to surviving members mimics the operation of an annuities market. Thus LCW2 can be 
thought of as essentially the life-cycle wealth of the U.S. economy if individuals always 
purchased annuities. 
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age 75, all of his or her individual life-cycle net worth is assumed to be 
transferred to the surviving spouse; it was furthermore assumed that 
the surviving spouse does not die prior to age 75. Thus, for example, 
if a 40-year-old male dies in 1960 with $20,000 of life-cycle wealth, 
this $20,000 is transferred to a 37-year-old female who is assumed still 
to be alive 14 years later in 1974. The $20,000 of received transfers is 
accumulated up to 1974 at the prevailing interest rates and included 
in life-cycle wealth. 

This procedure overestimates accumulated interspousal transfers 
for three reasons. First, not all transfers of decedents go to surviving 
spouses or even to surviving relatives in the same age cohort. Second, 
not all surviving spouses will, themselves, live until age 75. Rather, 
many will die much earlier, leaving the bulk of their residual wealth to 
children or grandchildren. Third, some decedents die without ever 
having married, leaving their estates to younger cohorts. 

Table 2 reports the life-cycle wealth numbers, LCW1 and LCW2, 
based on interest rate series 1 and series 2, as well as constant interest 
rates of 2, 4, 6, 8, and 9 percent. The series 1 values for LCW1 and 
LCW2 are, respectively, —$1,036 and $502 billion. The series 2 
figures are — $1,229 and $733 billion. These figures are strikingly 
small; they are even more surprising in light of assumptions made 
that bias the calculation toward more life-cycle wealth.'® Under the 
stricter definition of life-cycle wealth, LCW1, life-cycle accumulated 
hump savings is a large, negative number. The allowance for inter- 
spousal transfers yields positive, but very small, estimates of hump 
savings. The $733 billion series 2 LCW2 figure for life-cycle wealth 
represents only 19 percent of the total 1974 U.S. wealth. These small 
values for life-cycle wealth do not reflect the choice of interest rates; 
no historically reasonable constant nominal interest rate will yield 
significant positive life-cycle wealth. Accumulated intergenerational 
transfers appear to represent the bulk of the $3,884 billion of U.S. 
wealth holdings in 1974. Subtraction of 733 from 3,884 gives $3,151 
billion as the estimate of the 1974 stock of transfer wealth. Taking the 
.7 (r exceeds n by about 1 percent) adjustment factor as illustrative, 
entirely eliminating intergenerational transfers would reduce U.S. 
wealth by about $2.2 trillion in the context of a partial equilibrium, 
long-run steady-state model. 

To explain the large stock of U.S. wealth, the life-cycle theory of 
savings must rely on a substantial excess of earnings Over consump- 
tion when young followed by an excess of consumption over earnings 


'’ These include increasing the labor income of the self-employed by 20 percent, 
using a high value of .55 for the ratio of female to male earnings at age 40, assuming 


that the age-consumption profile is flat after age 75, and assuming that all earnings 
after age 75 are zero. 
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Fic. 1.—Sum of male and female longitudinal average earnings and average consump- 
tion profiles, age 18 in 1910—age 82 in 1974. 


when old. The historical reality for the United States is simply that 
longitudinal earnings and consumption profiles have not exhibited 
the kinds of shapes required for substantial life-cycle accumulation. 
For males, earnings profiles greatly exceed consumption profiles over 
most of the life cycle. For females, however, the Opposite is true. 
Life-cycle wealth starts out negative and remains negative over all 
ages. Figures | and 2 graph longitudinal profiles of the sum of male 
and female average earnings plus government transfers and average 
consumption for two different age cohorts. The profiles are pre- 
sented in real 1967 dollars. The diagrams clearly show that the male 
excess of earnings over consumption is essentially offset by the female 
excess of consumption over earnings. 

Contrary to the life-cycle simulation studies that have generated 
substantial life-cycle wealth, the actual growth rate of lifetime con- 
sumption does not substantially exceed the actual growth rate in 
lifetime earnings.”” For example, the male age cohort that reached 


*° Roger Gordon has suggested to us that observed age-earnings profiles may differ 
from true age-earnings profiles to the extent that firms and workers are engaged in 
implicit contractual arrangements. Firms could pay out to workers less (more) than they 
truly earn when young and more (less) than they truly earn when old. Some part of a 
worker's life-cycle saving or dissaving would then be accomplished within the firm and 
would correspond to a claim (or liability) attached to the firm’s assets. While we would 
strongly contest the empirical validity of this Proposition, certainly a very upper-bound 
estimate of life-cycle savings with firms would be the difference between the market and 
replacement costs of capital in the corporate sector. Using 0.819, the average value of q 
over the period 1952-74 (von Furstenberg 1977) and the 1974 replacement value of 


corporate capital of $1.679 trillion gives an upper-bound estimate of $304 billion for 
this effect. 
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Fic. 2.—Sum of male and female longitudinal average earnings and average con- 
sumption profiles, age 18 in 1940-age 52 in 1974. 


age 18 in 1920 experienced a growth rate of 2.93 percent in real 
earnings between ages 18 and 65. In comparison, the male growth in 
real consumption between ages 18 and 72 was only 2.32 percent. For 
the corresponding female cohort, earnings growth was 2.23 percent 
and consumption growth only 1.72 percent. Real consumption pro- 
files are fairly flat, and real earnings profiles peak in late middle age, 
not at early ages. Both of these factors militate against the life-cycle 
theory of savings as an explanation of U.S. wealth holdings. 

The large transfer wealth value of $3.151 trillion cannot simply be 
explained as private transfers from old to young offsetting forced 
government social security and medicare transfers from young to old. 
Under the Barro (1974) view, introducing unfunded social security 
into an economy will not lead to a change in consumption but will 
increase private net transfers from old to young. If Barro’s view is 
correct, it is easy to determine how large private transfers would have 
been with no social security and medicare system by simply setting all 
historic social security and medicare benefits and taxes to zero in the 
calculation. This procedure led to a series 1 value of $775 billion anda 
series 2 value of $1,094 billion for LGW2. Hence, even if Barro is 
correct in his view about unfunded social security, transfer wealth 
would still have exceeded $2.79 trillion in 1974 if there had never 
been a social security program. 

One possible problem with the estimates is that the age- 
consumption profiles prior to 1960 may have looked substantially 
different from those after 1960. To explore this issue, relative con- 
sumption prior to age 40 was reduced by 10 percent and relative 
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consumption after age 40 was increased by 10 percent. Thus, 60- 
year-olds were effectively assigned an extra 20 percent consumption 
relative to 30-year-olds in every year from 1900 to 1974. Altering 
consumption profiles in this manner produced values of —193 for 
LCW1 and 1,702 for LCW2 using series 1 and —169 for LCW1 and 
2,178 for LCW2 using series 2. These numbers are still substantially 
smaller than the $3,884 billion of 1974 U.S. net worth. 

Another issue of concern is whether our data series on aggregate 
flows accurately describe U.S. experience in the 1900s. As mentioned, 
to insure against underestimation of the labor income of the self- 
employed, we have already increased our estimates by 20 percent. 
Raising this add-on factor to 30 percent increases the series 1 values of 
LCW1 to —679 and LCW2 to 872. It is worth noting that eliminating 
any self-employment add-on factor lowers LCW1 to —1,739 and 
LCW2 to —714. 

A final data issue concerns consumption expenditures of nonprofit 
institutions as well as contributions of individuals to such institutions. 
Ideally we should subtract institutional consumption from aggregate 
consumption and add contributions to these institutions to arrive at 
total household consumption. While not all the data needed for this 
adjustment are available, what data are available suggest that the 
adjustment would be very small and would slightly lower our estimate 
of life-cycle wealth.”! 

The LCW2 wealth concept effectively deals with possible bias aris- 
ing from differential survival probabilities between the rich and the 
poor. This calculation assumes that at least one member of each 
household survives to age 75 and attributes all household life-cycle 
accumulation to the surviving spouse(s). Hence, this procedure as- 
sumes that rich and poor households have identical survival prob- 
abilities, which eliminates this issue of bias. 

Another type of aggregation error could arise if some households 
continually received higher rates of return on their assets than other 
households. In order to investigate this issue one would need detailed 
knowledge of the joint distribution of rates of return and household 
consumption and earnings patterns. Unfortunately, this information 
cannot be obtained from existing micro data, and this fact precludes a 
reasonable assessment of the magnitude of this source of error. 


*! In 1970, for example, the level of philanthropic payments of individuals totaled 
$16.09 billion (U.S. Department of Commerce 1975, vol. 1, series H 399-400, p. 359). 
In comparison, the NIA (U.S. Department of Commerce 1976, p. 90, lines 101 and 102) 
reported 1970 expenditures of institutions for religious, educational, and welfare 
activities totaled only $11.32 billion. 
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V. Explaining the Residual—the Stock of Transfer Wealth 


This section investigates the extent to which U.S. data on intergener- 
ational transfers can explain the large residual between total net 
worth and the estimates of life-cycle wealth. Unfortunately, there are 
very few data detailing nontaxed intergenerational transfers. These 
nonenumerated transfers take many forms. Parents who lend money 
at below market rates to their children for a down payment on a house 
or a business are engaged in an intergenerational transfer. A father 
who makes his son a full partner in a lucrative business can effectively 
transfer large sums of money with no tax liability to the IRS. Parents 
who fully or partially support their children through college or after 
college are making transfers. A grandmother’s gift of her expensive 
wedding china and rings to her granddaughter is an intergenera- 
tional transfer. Transfers in these forms, as well as outright monetary 
gifts, are very rarely reported. 

To obtain some idea about how large aggregate intergenerational 
transfer flows need to be in order to explain a $3.151 trillion stock of 
transfer wealth, consider the multiperiod analogue to equation (8): 


e r—n)D 


(7 — n) 


Equation (19) is a simplified expression for the steady-state stock of 
transfers; the formula assumes that all transfers are given at age G 
and received at age. The yearly flow of intergenerational transfers 1s 
t, the age of death D, andr and n are rates of interest and population 
plus productivity growth, respectively. In the case r = n, this expres- 
sion reduces to T = t(G — I), the analogue to equation (9). Table 3 
evaluates the age gap factor, that is, the terms multiplying ¢ for 
various parameter values and taking D to be 55.” 

Exactly which age gap is appropriate is unclear.” An age gap of 30 
allows some significant transfers to grandchildren as well as children. 
The age gap factor in this case for a .01 excess of r over n is 45. Hence, 
to explain $3,151 billion in transfer wealth, the yearly flow of trans- 
fers would have to equal 3,151 divided by 45, or $70 billion. A 2 
percent differential between (r — n), again assuming a 30-year gap, 
would require only a $46 billion annual transfer flow. 

The 1962 Federal Reserve Survey of Consumer Finances was used 


T=t [1 = ee (19) 


22 If we take age 18 as the age of adulthood, the value of 55 for D corresponds to a 


real-world age of death of 73. 4 
23 To precisely calculate a weighted age gap we would need data detailing age of 
donors and recipients as well as the size of the transfers. Such data are currently 


unavailable. 
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TABEER 3 


AGE GaP FACTOR 


Teh 
Ace Gar (G— I) —.02 01 0 01 02 03 
SORE Tae eT 7 8 13 20 31 49 78 
OF rt eek ea ll 16 25 38 59 92 
SO peters oo ee 14 20 30 45 68 103 
Soa ee Ae oe 17 24 35 51 76 113 





to estimate the bequest portion of the yearly transfer flow. 
Specifically, the distribution of net worth holdings by age, sex, and 
marital status was calculated. Next, 1962 mortality probabilities were 
applied to this distribution to arrive at an estimated distribution of 
bequests by age, sex, and marital status. An additional adjustment to 
these figures was made in order to reconcile the U.S. aggregate net 
worth value estimated from the Federal Reserve tape with the FRB- 
MIT net worth data. 

Unpublished data collected by Paul Menchik and Martin David 
from the Washington, D.C., Inheritance Tax File for the year 1967 
reveal that males who were married at the time of their death left 10 
percent of their estate to their children. Married female decedents left 
19 percent of their estates to their children. The proportion left to 
grandchildren and other young relatives is unclear, but it probably 
does not exceed 2 percent for males and 4 percent for females, which 
represents half of the percentage contribution to other relatives. For 
single male decedents, 32 percent of the estate is left to children; the 
single female proportion is 37 percent. Again taking half of the 
contributions to other relatives as “distant in age” intergenerational 
transfers, another 22 percent can be ascribed to single males and 23 
percent to single females. Using these figures and the Federal Reserve 
simulated bequest distributions, the estimated “distant in age” in- 
tergenerational bequest transfer flow in 1962 was $11.9 billion. Mul- 
tiplying this figure by the 221.6 percent growth in the nominal value 
of total U.S. net worth between 1962 and 1974 gives us an estimate of 
the 1974 bequest transfer flow of $26.4 billion. 

In 1974 the total value of life insurance death benefits equaled 
$8.885 billion. In conjunction with the Menchik numbers, this raises 
the $26.4 billion bequest flow figure to $28.9 billion. 

We also estimate the flow of intergenerational transfers from par- 
ents to children that occur in the form of financial support during 
college. In 1974 college enrollment totaled 8.8 million students (U.S. 
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Department of Commerce 1978, p. 138). In 1976 parental contribu- 
tions to college-enrolled children who were taken as tax deductions 
averaged $1,738 (Froomkin 1977, p. 479). Assuming that non-tax 
dependent college-enrolled children received one-quarter of this level 
of support, average support from parents in 1976 was $1,270. Re- 
ducing this number by 15 percent, the growth in tuition between 1974 
and 1976 (U.S. Department of Commerce 1978, p. 165) suggests a 
level of college support payments of about $1,080 in 1974. This type 
of intergenerational transfer could then account for $10.3 billion of 
the total 1974 flow. Adding the 10.3 to the 28.9 leaves us with $39.2 
billion of explained transfer flows. 

Another component of the intergenerational transfer flow is trans- 
fers made in the form of trusts. While direct data on the value of new 
trusts formed in 1974 are not available, there are fiduciary income tax 
data for the years 1965 and 1970 that permit a rough calculation of 
this transfer flow component (U.S. Internal Revenue Service 1970, p. 
46). Between 1965 and 1970 the number of new trusts established 
each year averaged 35,098. The 1970 income of the 152,398 existing 
trusts in 1970 totaled $7,513 billion. The average 1970 trust had, 
therefore, an income of $9,985. Dividing this value by our 1970 series 
1 interest rate of 0.0787 gives an estimate of the average value of a 
1970 trust of $126,874. Multiplying this figure by 35,098 leads to an 
estimate of $4.44 billion as the value of new trusts established in 1974. 
After multiplying this number by 1.395 to allow for the growth in 
total wealth between 1970 and 1974, our estimate of the 1974 flow of 
intergenerational transfers in the form of trusts is $6.19 billion. This 
figure raises the explained transfer flow to $45.4 billion. 

This $45.4 billion figure seems small compared with the $70 billion 
total flow needed if the stock of transfer wealth is $3.151 trillion and 
the age gap factor is 45. On the other hand, it is not too far from the 
$46 billion total flow needed if r exceeds n by 2 percent and the age 
gap factor is 68. 

Unfortunately, before one can precisely determine the total in- 
tergenerational flow, substantially more information will have to be 
collected about family gift giving and support payments to children 
and grandchildren. These data may prove particularly difficult to 
obtain since we would have to estimate the value of the cedar chest 
passed from grandmother to grandnephew, or the family car given to 
son John as a college graduation present, or the value to son Alex of 
making him a full partner in a lucrative family business. Since the 
distribution of wealth is very highly skewed, such surveys need to be 
aimed at the intergenerational transfer payments of the very wealthy. 
However, the very wealthy may be the least willing to disclose these 
types of transfers because of potential estate and gift tax liabilities. 
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VI. Summary and Conclusions 


The evidence presented in this paper rules out life-cycle hump saving 
as the major determinant of capital accumulation in the U.S. econ- 
omy. Longitudinal age-earnings and age-consumption profiles do not 
exhibit the kinds of shapes needed to generate a large amount of 
life-cycle wealth accumulation. The view of U.S. capital formation as 
arising, in the main, from essentially homogeneous individuals or 
married spouses saving when young for their retirement is factually 
incorrect. 

Intergenerational transfers appear to be the major element deter- 
mining wealth accumulation in the United States. Our best estimates 
of the 1974 stock of transfer wealth after allowing for interspousal 
life-cycle accumulation is approximately $3 trillion. 

While these estimates of the stock of transfer wealth are quite large, 
totally eliminating transfers in the U.S. economy would not necessar- 
ily reduce total U.S. wealth by the full amount of transfer wealth. We 
have demonstrated within the context of a steady-state growth model 
that a $1.00 reduction in the stock of transfer wealth may reduce total 
wealth by less than $1.00 if the steady-state real interest rate exceeds 
the steady-state growth rate. Taking the U.S. historical real interest 
and growth rates as illustrative, eliminating a $3 trillion stock of 
transfer wealth would reduce total U.S. wealth by about $2.1 trillion 
in a steady-state context. This, however, is a partial equilibrium analy- 
sis. Substantially more research must be undertaken before we can 
begin to attach probable numbers to full general equilibrium re- 
sponses to changes in transfers. 

This paper suggests the importance of and need for substantially 
more research and data collection on intergenerational transfers. 
Economic models of savings that stress the homogeneity of agents and 
the importance of the demographic structure should give way to 
models that emphasize the rather massive intergenerational transfers 
in the U.S. economy and the apparent concentration of these trans- 
fers among the very wealthy. 
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This paper examines the controversy concerning the relative desira- 
bility of fixed versus flexible exchange rates by examining whether a 
country can achieve a smaller variance of domestic output around its 
full empioyment path operating under an exchange rate rule or 
under a money supply rule. The paper finds that neither policy will 
always dominate the other. However, it does find that if shocks in 
one market of the economy are large relative to shocks to other 
markets, then one type of rule can be shown to dominate the other. 


I. Introduction 


One of the major concerns of the international trade literature has 
been whether or not a country is better off under a fixed or under a 
flexible exchange rate system. There is no consensus that one type of 
exchange rate scheme will always dominate the other. Rather, the 
optimal exchange rate scheme for a particular country has been 
found to depend upon the degree of openness of the country, the 
degree of international capital and labor mobility, and its rate of 
adjustment to disturbances.’ Another consideration has been whether 
or not the disturbances experienced by the country are more likely to 
be external or internal in origin, although recently the emphasis has 


I am indebted to Francis Ahking, Russell Boyer, Rodney Falvey, Anatoli Kuprianoy, 
Leslie Lipschitz, Michael Salemi, Wilson Schmidt, members of the Workshop on Mon- 
etary Theory at VPI&SU, and two anonymous referees for useful comments. I take 


responsibility for any remaining errors. 
' For a good survey of the conditions under which one exchange rate arrangement 


will dominate the other, see Tower and Willett (1976). 
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shifted to the market in which the disturbance occurs rather than its 
geographical origin? 

This paper examines the controversy concerning the relative de- 
sirability of fixed versus flexible exchange rates from a different point 
of view. The question examined in this paper is whether a country can 
achieve a smaller variance of domestic output around its full employ- 
ment path operating under an exchange rate rule or under a money 
supply rule. The exchange rate rules we consider will be completely 
general, so that a country operating under an exchange rate rule can 
be considered to be engaged in managed floating and, as a conse- 
quence, will have an endogenous domestic money supply. On the 
other hand, when an economy operates under a money supply rule, 
its exchange rate becomes endogenous, so that an economy operating 
under a money rule can be thought of as having perfectly floating 
exchange rates. The money supply rules we consider will also be 
completely general. 

The model which underlies this analysis is an aggregate supply— 
aggregate demand model for an open economy with perfect capital 
mobility and rational expectations. The model is a stochastic and 
dynamic version of that used by Parkin (1977). It also has the same 
basic framework as Dornbusch (1976) but departs from his actual 
model by assuming continuous market clearing and rational expecta- 
tions. The model is presented in Section Il. The reduced-form so- 
lutions for real output under the alternative policy rules are found in 
Section III. In Section IV the variability of output under the two 
alternative policy rules is compared and some implications are pre- 
sented. A summary is presented in Section V. 


II. The Model 


Following Dornbusch (1976) and Parkin (1977) we model a country 
which is small enough in the world capital market that it cannot affect 
the interest rate it faces. Thus, capital mobility will insure the equality 
of foreign and domestic interest rates after adjustment for expected 
exchange rate appreciation or depreciation. We also assume that 
domestic goods are imperfect substitutes for foreign goods, so that 
the domestic price level can differ from the foreign price level by 
more than the current exchange rate. Further, we assume that the 
country is small enough that its demands for imports will not affect the 
price of imports, and we arbitrarily set the price of foreign goods 
equal to unity.’ 


* Two recent articles which focus on disturbances by market origin rather than 
geographical origin are those by Boyer (1978) and Lipschitz (1978). 
* By assuming that domestic output and foreign output are not perfect substitutes, 
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The basic model consists of the following four equations: 


Je OP, — BP.) + ayia + be (aggregate supply curve), (1) 


Ve Olt Pg Pellet 0s(@: =P.) vz (IS curve), (2) 
Me = CoP, + Cy, + Coy + Cz3e¢ + Me (LM curve), (3) 
 =2 + Ey_1 (C141 — e;) (interest rate parity). (4) 


The variables are defined as follows: y, = the level of real output 
(income); m, = the domestic nominal money supply; P; = the price 
level; 2, = one plus the nominal interest rate; e, = the exchange rate 
(the domestic currency price of foreign currency); 7 = one plus the 
world nominal interest rate; and y;,, %, n;, €,, and w, = white-noise 
stochastic disturbance terms. All variables are expressed in log- 
arithms. The operator E,_, denotes the rationally formed expecta- 
tion based upon information available at time ¢ — 1. All stochastic 
error terms are assumed to be white noise and distributed inde- 
pendently of each other. Later in the analysis the operator o?. will be 
used to denote the variance of the random variable x. 

The aggregate supply curve (1) is similar in form to that proposed 
by Lucas (1973) and has been used in a closed economy macro model 
by Sargent and Wallace (1975) and McCallum (1978) and in an open 
economy model by Flood (1979). It incorporates the natural rate 
hypothesis since real output is affected only by unanticipated changes 
in the price level. Following Gray (1976) and Fischer (1977b), we 
justify (1) as follows. Assume that nominal wage contracts for period t 
are negotiated at the end of period t — 1 and equate expected labor 
supply and expected labor demand conditional on the information 
available when the contracts are negotiated. Actual employment in 
period t, however, depends upon the realized wage in the period, and 
for this reason innovations in the price level cause fluctuations in 
employment and output.4 Further, since employment varies inversely 


our model differs from the models of Bilson (1978) and Flood (1979). We will also use 
money rules in our analysis which are more general than the random-walk money 
growth rules which they use. 

* The aggregate supply curve (1) is also obtained by Gordon (1978, chap. 7) using 
assumptions very similar to those of Gray (1976) and Fischer (19776). We have not used 
Lucas’s derivation of (1) as he assumes informationally separated markets. As Flood has 
pointed out, the assumption of informationally separated markets on the supply side is 
not consistent with the contemporaneous trading across countries implicit in eqq. (2) 
through (4). Such contemporaneous trading also implies that the behavioral param- 
eters in the model can be assumed to remain unchanged as the monetary authority 
switches from an exchange rate rule to a money rule or vice versa. As Lucas (1973), 
Barro (1976), and Leiderman (1979) have shown, such an assumption may not be 
correct if agents operate in informationally separated markets and must make infer- 
ences about the price level or the exchange rate based upon transactions prices ob- 
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with the real wage, we restrict a, > 0. The omission of a constant term 
in (1) indicates that y, should be interpreted as the deviation of output 
about the level which would obtain if there had been no past innova- 
tions in the price level. 

The IS curve (2) is that of conventional theory with two modifica- 
tions. The first is that the interest rate variable is the real rather than 
the nominal interest rate. This modification is required by the fact 
that when the price level is allowed to change in the analysis, invest- 
ment demand should depend upon real rather than nominal interest 
rates.” The second is the presence of the relative price of foreign to 
domestic goods. (The logarithm of the foreign price level is omitted 
since the foreign price level was assumed to be unity.) Since higher 
real interest rates should depress the demand for domestic output 
and since an increase in the price of foreign goods relative to domestic 
goods should increase the demand for domestic goods, we expect b, = 
0 and b, = 0. 

That the LM curve (3) is the conventional LM curve can be seen by 
rewriting (3) as 


Mt — CoP, — C3 = Cie + Colt + Ne. (3) 


Thus, (3’) states that the demand for real balances depends upon real 
income and nominal interest rates and that for the money market to 
be in equilibrium the demand for real balances must be equal to the 
supply of real balances. The reason for including the exchange rate in 
the determination of real balances is that individuals in the economy 
may deflate their holdings of nominal money by a combination of 
both domestic and foreign prices since they can purchase both 
domestic and foreign goods.® We expect that cg, c3 = 0 and ¢) + cg = 1. 
Further, the conventional demand-for-money view is that c, => 0 and 
Co = 0. 

Equation (4) results from the assumption of the smallness of the 
country in world capital markets and from the assumption that 
foreign and domestic assets are perfect substitutes for investors. As a 
consequence, capital mobility will force equality of expected nominal 
returns on domestic and foreign bonds expressed in units of the same 
currency. 


In equations (2) and (4) of the model, expectations concerning the 


served in their market. It would be interesting to examine how the conclusions of this 
paper would change if this informational assumption were modified. 

° In general, for a random variable x, log (Ex) > E (log x). Therefore, i, — Ey-,(Pr41 — 
P,) is only approximately equal to the logarithm of one plus the real interest rate. 
Further, (4) is only an approximation to the interest rate parity condition for the same 
reason. However, log (Ex) is equal to the expected value of the first-order Taylor series 
expansion of log x around Ex. I am indebted to a referee for this point. 

* This same assumption is made by Boyer (1978, p. 1047). 
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rates of change of the price level and the exchange rate are assumed 
to be conditioned on information available in period t — 1. On the 
other hand, period t information on the price level, the nominal 
interest rate, and the exchange rate are assumed to be used by agents 
in their output and money demand decisions. This raises the possibil- 
ity that the model may be asymmetric with respect to its information 
assumptions, although such asymmetry could be reasonable if bond 
purchases were assumed to be made at the beginning of period t 
before the output and money demand decisions. Nonetheless, we 
recognize that there is a degree of arbitrariness with regard to the 
assumptions concerning the conditioning of expectations. For this 
reason we treat the case where agents use period ¢ information to 
form expectations in the Appendix. Both assumptions about the 
conditioning of expectations have appeared previously in the litera- 
ture. The papers by Sargent and Wallace (1975), McCallum (1978), 
and Flood (1979) use expectations conditioned on information avail- 
able at time t — 1, whereas expectations conditioned on information 
available at time ¢ are used by Bilson (1978) and by Flood in his 
appendix. 


III. Implications 


We now solve the model given by (1) through (4) first under a money 
policy rule and then under an exchange rate rule. 


Money Rule 


In an economy in which the monetary authority is following a rule for 
the determination of the money supply, the exchange rate is an 
endogenous variable; that is, the country operates under flexible 
exchange rates. When ¢; is endogenous, solving equations (1) through 
(4) for the price level yields 


Pr= k{(e3 + bo¢1)(Q1E¢-1P — G21) 
+ c3b,[2 + EP Crit) 2 belli Py) (5) 
— bocylt + Ey-1(C141 — €2)] + dome + Or} 


where 6; = [—(c3 + €,b2) pe + C3¥4 — bom] and k = (by + ayC3 + C1452)". 
Applying the E,_, operator to both sides of (5) and subtracting the 
result from (5) yields 


Py -— EpsP, = kh(Q + bo€r), (6) 


where €, = m, — E;-1m is the innovation in the money supply in period 
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t. Substituting (6) into (1), we obtain the reduced-form solution for 
real output as 


ye = yh (Oy + dg€¢) + Goyt—-1 + Me 


(7) 
= k[Dopy + ayC3¥_ + ay bo(€ — Ne) + a2 \t-1- 


Given (7), the solution for the real exchange rate (e, — P;) can be 
obtained from (2) and (3) using the Sargent-Wallace forward solution 
technique. We find’ 


Chat e —bjtlb, + {d2/[be + b, (a, — 1) y-1 


+ kp, — (1 + €,03)¥: + a1 (& — 0:)I. 


(8) 


Substituting (7) and (8) into (2) yields the solution for the real interest 
rate as 


Cpe Ea epg ag) ed {2 (de a 1)/[b, a bi(a2 > 1)}} ye-1. (9) 


Further, from (7) we obtain the variance of real output under a 
money rule to be 


Crim = RL — a3) [bon + (aresPor + (ayb2)P(on + a2]. (10) 


Since E,_;m4;, j) = 0, terms do not appear in the solutions (7) 
through (9), our results indicate that changes in the money rule do 
not have impacts on the real variables in the economy. And since this 
is true for the levels of these variables, it must hold for their variances 
as illustrated by (10). On the other hand, equations (7) through (9) 
show that with the exception of the real interest rate, the real variables 
in the economy do respond to unanticipated shocks. These are, of 


* The solution to obtain (8) requires us to first obtain the forward solution to: 


b 1 


TET Ges ht) ye =~ Pe eels ee 
By repeated substitution, the above equation can be written as: 
Las bo Bee 
E,4(@ — P,) = - — ———} (byt — Ext yess 
t-1(6¢ t) Pasa Di bb )¢ it — Evy e435) 


‘i n+1 
ze i —— Eerie = bene) 
2 1 


Thus, (8) requires the assumption that: 


on) E-i1(Gt+n — Prin) > 0. 

The imposition of this terminal condition rules out the possibility of expected changes 
in future relative prices, which would lead individuals to demand infinite quantities of 
either foreign or domestic goods in the present period. This terminal condition is 
similar to that of Sargent and Wallace (1975, p. 248). Eq. (8) also requires that Ey, 74; 
not be too explosive, i.e., that a, not be too large. 


iim = 


no 
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course, the familiar results for aggregate models which incorporate 
rational expectations into a natural rate aggregate supply equation 
such as (1).8 

The solutions (7) through (9) also have interesting implications for 
whether or not the domestic economy is independent of the rest of 
the world under a money rule. From (7) we find that the level of real 
output is independent of the world interest rate. So, from this per- 
spective, our results agree with Parkin’s (1977, p- 238) conclusion that 
the domestic economy is independent of the rest of the world under a 
money rule. However, the time paths of the real exchange rate and 
the real interest rate depend on the world interest rate but not on the 
money rule being followed. Thus, the domestic economy is not inde- 
pendent of the rest of the world in the sense that changes in the world 
interest rate will have real impacts on the domestic economy which 
cannot be offset through the money rule.? 

The reason that a change in the world interest rate will have real 
impacts can best be seen by demonstrating that the opposite cannot 
occur. If changes in the world interest rate were to have no impacts on 
domestic real variables, an increase inz would have to be offset by an 
equal decrease in F;_,¢4, in order to leave the real interest rate and 
the real exchange rate unchanged. However, the decrease in next 
period’s exchange rate implies a decrease in next period’s relative 
price of foreign and domestic goods and next period’s demand for 
output. Consequently, current expectations of next period’s price 
level will decrease. This reduces the demand for current period 
output since the real interest rate will be increased. However, since 
current output cannot change due to the form of (1), equilibrium 
must be restored through increases in the real exchange rate to offset 
the increase in the real interest rate.!® 

In order to illustrate the way in which unanticipated disturbances 


* Explicit solutions for P; and e, could be obtained by solving the system of equations 


b P, _ C3 De ee a a (11) 
: et —Co be = Clt + By (Cry — @:)) + Bem: + €p = He = Cre 


simultaneously using the Sargent and Wallace forward-solution technique. From (11) it 
is obvious that such solutions will include the E,_,m,4;,7 = 0, as independent variables. 
Thus, the nominal variables in this economy will respond to changes in the money 
supply rule being followed. 

® From (11) it is obvious that both and policy parameters will appear in the solutions 
for the domestic price level, the exchange rate, and the nominal interest rate. Thus, the 
expected levels of these variables can be determined by the domestic monetary au- 
thorities regardless of the level of the world interest rate. 

‘© Since we have assumed that the world price level is determinate and fixed, changes 
in 2 can only be caused by changes in the world real interest rate. Thus, it is not 
surprising to find the domestic real interest rate affected by such changes under flexible 
exchange rates. 
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affect the real and nominal variables under flexible exchange rates, 
we examine the effects of an unanticipated increase in the domestic 
nominal money supply. Differentiating (7) through (10) with respect 
to e,, we find that an unanticipated increase in the nominal money 
supply will increase real output, the exchange rate, and the price level 
while decreasing both the nominal and real interest rate. Further, if a, 
> 0, the increase in the price level will be less than the increase in the 
exchange rate so that the real exchange rate will increase. 

These changes occur for the following reasons. The unexpected 
increase in the nominal money supply leaves expectations of current 
and future prices and exchange rates unchanged. However, it does 
cause money supply to be greater than money demand with a corre- 
sponding equal-valued excess demand for both domestic and foreign 
bonds which are perfect substitutes. The excess demand for bonds 
will put upward pressure on the exchange rate as agents attempt to 
sell domestic money for foreign money to buy foreign bonds. The rise 
in the exchange rate increases the demand for output through the IS 
curve. Thus, the effect of the unanticipated increase in the nominal 
money supply is to shift out the aggregate demand curve. There will 
then be an excess demand for output which will cause upward pres- 
sure on the domestic price level, and since the price level increase is 
unanticipated, it will cause the supply of output to increase unless a, = 
0. Obviously, the rise in the price level must be less than the increase 
in the exchange rate otherwise there would be no shift in the aggre- 
gate demand curve. 

Because of the recent discussion concerning whether the exchange 
rate will overshoot or undershoot its final level in response to a 
monetary disturbance,'' we conclude the discussion of the money rule 
case by determining the time path of the exchange rate in response to 
an unanticipated change in the money supply. To facilitate this dis- 
cussion, we assume a specific money rule of the simple autoregressive 
form 


m= PmMmey, + Ey. (12) 


Under such a rule, the time path of the exchange rate, ignoring terms 
involving the world interest rate and errors other than &, is 


er = Alby + are, + > (vip! +arbokyad)esi, (13) 


j=0 


where 


__' The overshooting phenomenon is obtained by Dornbusch (1976, 1978), although 
in the latter paper he finds that undershooting rather than overshooting occurs when 


price levels are allowed to adjust. The overshooting effect has also been found empiri- 
cally by Frenkel (1978). 
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es fae a 
ee (ip) 


and 


Co — €y1b;(ag — 1) + bs | 


Veta? SAC) ee bale ea (ast) 


Differentiating (13) with respect to €,, we obtain the initial effect of 
the monetary shock to be 


Oe: 0s 
de, a k (by a a;) > 0, (14) 


so that the immediate effect of the unanticipated increase in the 
money supply is to increase the exchange rate. The reasons for this 
effect are given in the above discussion of the impact effects of 
unanticipated changes in the money supply. 

The effects of an unanticipated change in the money supply in 
period t on the exchange rate in subsequent time periods are obtained 
by differentiating (13) for period t + 7 with respect to e, to obtain 


0 r+; 


aca yip7 1 + a,boky,at 1,7 > 1. (15) 


From (15) we first find that if both | p | and | dy | are less than unity, 
unanticipated changes in the money supply will decay exponentially. 
We also find that unanticipated changes in the money supply in 
period ¢ will not have impacts past the initial period unless there are 
feedback effects in either the money supply rule or the aggregate 
supply function, that is, unless either p or a, is nonzero. 

However, when feedback effects are present in the money supply 
rule but not in the aggregate supply function (a, = 0 and p > 0), we 
find even though ¢4, will be above the long-run level of the exchange 
rate, we cannot determine whether it will be above or below ¢. Thus, 
in this case we find that even though the exchange rate will always 
initially overshoot its long-run level in response to an unanticipated 
money supply change, the overshooting actually may be larger in the 
period after the shock than in the period of the shock. 

On the other hand, when we relax the assumption that a, = 0 and 
allow 0 <a» < 1, we find that it is possible for the initial overshooting 
of the exchange rate in period ¢ to be followed by undershooting in 
subsequent time periods. This result occurs because it is not possible 
to sign y2. Thus, our analysis indicates that even though an unantici- 
pated money supply increase must initially increase the exchange 
rate, if lagged output terms appear in the aggregate supply function, 
the exchange rate may either undershoot or overshoot its long-run 
level in subsequent time periods. 
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Exchange Rate Rule 


We now consider the case in which the monetary authority in the 
economy follows an exchange rate rule rather than a money supply 
rule. A monetary authority following an exchange rate rule sets a 
target value for the exchange rate (é,) according to some known rule. 
This rule could be of the linear feedback type. Alternatively, the rule 
could be that é = é for allt, yielding the case of fixed exchange rates, 
or that the exchange rate appreciate or depreciate at a given rate over 
time. The important point for the analysis which follows, however, is 
not the particular form of the exchange rate rule. Rather it is that the 
rule used to determine the exchange rate be known and constant over 
time. In this case, the rational expectation of the exchange rate in 
period ¢ + 1 will be Ey, (é4;) = 4). 

Of course, the monetary authority cannot establish é, by fiat. Under 
an exchange rate rule the monetary authority must accomplish its 
objectives by buying or selling foreign or domestic reserves until é, is 
achieved. However, since either action will affect the domestic money 
supply, under an exchange rate rule the domestic money supply 
becomes an endogenous variable. Further, given the structure of the 
economy, a given exchange rate rule will imply the time path of the 
domestic money supply and the monetary authority’s holdings of 
international reserves.'” 

When m, is endogenous, solving (1) through (4) for the price level 
yields 


P,= {b,[¢ ae ESC py ae ey EP aa PI 


1 
+ bey + + Q, Ey, P; — a2 Vt-1 Mr }/(be + a;). ao) 


Applying the £;_, operator to both sides of (16) and subtracting the 
result from (16) yields 


Py — Ey Pe = (4 — bt + b2@4)/(bp + ay), (17) 


where @, = e, — E,_1e, is the innovation in the exchange rate in period 
i.'° Substituting (17) into (1) yields the reduced-form solution for real 


” Certain exchange rate rules might require the monetary authority to sell a quantity 
of international reserves exceeding their initial holdings. Such exchange rate rules 
would obviously not be feasible and are implicitly ruled out of the analysis. However, 
since the variance of real output under an exchange rate rule is independent of the 
actual rule chosen, the infeasibility of some exchange rate rules does not affect our 
results. I am indebted to a referee for this point. 

'S Since the monetary authority is intervening in the foreign exchange market, it is 
difficult to see how such innovations in the exchange rate could occur. Nonetheless, we 
have introduced «; into the analysis of the exchange rate rule case to make it symmetric 
to the money rule case in which the actual money supply could deviate from the desired 
money supply of the monetary authority by e,. However, since @, is a white-noise 
random error which is assumed to be uncorrelated with the other disturbances, the 
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output as 
i (do pL + ayy, + ayb,@,)/(by + dy). (18) 


Using the Sargent-Wallace forward solution technique, the solution 
for the price level can be obtained from (16) as'* 


co 


byw a 1 
P = 1 2 ee ee ee 
eer: by + bia — 1)"  , — by) 2, 
—b, j 
Sane (by) Ep-1eetita + (b2 — bi) E e145] (19) 
by — by 


— (Me — Ve + be@,)/(be + a4). 


The real exchange rate can be obtained by subtracting (19) from e = 
E,_,@; + @;, and the real interest rate can be obtained by substituting 
for 9, and ¢, — Py in (2): 

The reduced-form solution for output (18) once again has the 
familiar neutrality result for models which incorporate rational ex- 
pectations and the natural rate in the aggregate supply equation. 
However, since (19) depends on F;_,e4; terms, under an exchange 
rate rule both the real exchange rate and the real interest rate depend 


inclusion of the @, does not modify the fact that e, is exogenously determined in the 
exchange rate rule case. 
14 The solution to obtain (19) requires us to first obtain the forward solution to: 
b 


a; — (Eri Pst gig cto pe py) cts Piped Cpe 
b, — by 


By repeated substitution, the above equation can be written as: 


1 
Ee oie ine 


n 


Digs ¥ b, e 
~ ; | (hey Ee Corp ete ian Chg 
EviPr >, ( pre b, )| bab; (i t-16¢45+1) t—16t+j 


1 b, n+1 
a bs a b, Exar € ie sos Eu-1Pttnt1- 


Thus, (19) requires that: 


: b ig 
lim — oe) E¢1Ptin— 0, 
which rules out “speculative bubbles” in domestic goods. Eq. (19) also requires that the 


process 


(Gg ie sesse Seer tse FF 
not be too explosive. These conditions are very similar to those required by Sargent and 
Wallace (1975). They differ from those necessary to obtain (8) because in the case of (8) 
we are solving for the real exchange rate and in the case of (19) we are solving for the 
domestic price level. It is also interesting to note that, whereas Sargent and Wallace 
have shown that in a closed economy the price level is indeterminate if the monetary 
authority follows an interest rate rule, we can obtain a determinate result if the 
monetary authority follows an exchange rate rule in an open economy. 


Evi e+ 
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upon the exchange rate rule being followed. The results that nominal 
variables (and the nominal money supply is included in this case) are 
affected by the policy parameters also continues to hold. 

Further, from (18) we find that under an exchange rate rule the 
variance of real output is 


ote = (0307, + alo? + atbsa2)il(be + a)? —"3) (20) 


which once again is independent of the particular exchange rate rule 
being followed. However, we do find that the variance of output 
under an exchange rate rule differs from the variance of output 
under a money rule. One major difference is that under an exchange 
rate rule the variance of output is independent of any disturbances 
which occur in the money demand equation. The reason is the fol- 
lowing. Under an exchange rate rule, the nominal interest rate is 
entirely determined by this rule and the world interest rate. With the 
interest rate thus fixed, real output and the price level are determined 
solely by the aggregate supply and IS curves. The only role of the 
money demand equation is to determine the level of the endogenous 
nominal money supply. 

The solutions (18) and (19) also have interesting implications for 
whether or not the domestic economy is independent of the world 
interest rate under an exchange rate rule. Since the systematic part of 
the exchange rate rule (the E;_,e;4; terms) appears in the solutions for 
both the real exchange rate and the real interest rate, the monetary 
authority can offset any changes in the world interest rate by chang- 
ing its exchange rate rule. Specifically, if the monetary authority 
reduces the expected exchange rate in the next period by the amount 
of any increase in the world interest rate while holding the expected 
current exchange rate constant, the domestic economy will be inde- 
pendent of changes in the world interest rate. Such an exchange rate 
rule would keep both the domestic nominal interest rate and the 
current exchange rate constant. Consequently, the change inz would 
leave aggregate demand in the domestic economy unchanged with 
the result that no real variables would change. 

Thus, we have found that whereas under a money rule the domes- 
tic economy will not be independent of the rest of the world, under a 
managed exchange rate system the domestic economy can be inde- 
pendent of the rest of the world in the sense that changes in the world 
interest rate will have no real effects. The difference in the two rules 
can be explained as follows. If changes in the world interest rate were 
to be neutralized under a money rule, the money supply would have 
to be systematically changed in response to changes in 7. However, 
since (2) through (4) are homogeneous of degree zero in the expected 
current money supply and the expected current and future price level 
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and exchange rate, any change in the expected money supply will be 
accompanied by proportional changes in the price level and exchange 
rate which leave the expected rate of change of the price level and the 
exchange rate unchanged. Consequently, under a money rule an 
increase in 7 will have real effects. Under an exchange rate rule, 
however, Ey_,(é41 — e:) can be modified without necessarily altering 
the current money supply, and an exchange rate rule can be devised 
to make the domestic economy independent of the world interest rate 
as discussed above.'® 


IV. Implications 


Comparison of the variance of output under a money rule and under 
an exchange rate rule indicates that it is not possible to determine that 
one rule always dominates the other. Like the results in the traditional 
analysis of the merits of fixed versus flexible exchange rates, we see 
that which rule is preferred depends upon location and magnitude of 
the variance of the stochastic disturbances in the model. Nonetheless, 
we can determine some special cases in which one type of policy rule 
will yield a lower variance of output than the other. 

We first consider the case in which the variance of the disturbance 
in the money demand function is large relative to the variances of the 
other disturbances. In this case, the variance of output will always be 
smaller if the monetary authority follows an exchange rate rule rather 
than a money rule, since under an exchange rate rule, real output is 
determined independently of the money market. Because fixed ex- 
change rates are a special case of our exchange rate rule, this result is 
similar to Boyer’s (1978) finding that a fixed exchange rate is optimal 
when shocks occur only in the money market and that is the market in 
which the authorities are intervening. Further, it corresponds to the 
results of Turnovsky (1976), Fischer (1977a), and Flood (1979) who 
find that money demand shocks do not affect output (consumption in 
the case of Fischer) under fixed exchange rates. 

Next, we consider the case in which the variance of the disturbance 
to the aggregate supply curve is large relative to the variance of the 
other disturbances. Setting 0? = 0% = a2 = 0% = 0 in (11) and (20), 
we obtain 


15 On the other hand, if the monetary authority follows a fixed exchange rate rule, 
the domestic economy will be affected by changes in the world interest rate since 
changes in the world interest rate will not be offset by exchange rate changes. Thus, 
changes in? will have real effects since the change in the domestic nominal interest rate 
will shift the aggregate demand curve. Further, (19) shows that with fixed exchange 
rates, the price level will be determined solely by the level of the world interest rate and 
random shocks. Thus, a fixed exchange rate policy will yield no inflation on average 
unless there is an upward drift in the level of the world nominal interest rate. 
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Tim! Coie = (a, + by) /[bp + ay(bec, + c3)P. (21) 


Thus, 


Cur es rm as Cb, + Cs =| re (22) 


This result is explained by the fact that the variance of the output 
response to an aggregate supply shock will be larger, the steeper is the 
slope of the aggregate demand curve. Under an exchange rate rule, 
the slope of the aggregate demand curve is 








OH | = - 23 
(apr), =~ ( 
Under a money rule, the slope of the aggregate demand curve is 
0 
( a ), = =De | (Ose3 =f Gs) (24) 


Thus, when c,b, + c3 < 1, the slope of the aggregate demand curve 
will be steeper and the variance of output larger under a money rule. 
However, when c,b, + cz > 1, the aggregate demand curve will have a 
steeper slope and the variance of output will be larger under an 
exchange rate rule. 

These results do not agree with those of Fischer (1977a) and 
Lipschitz (1978) who find that fixed exchange rate rules are always 
preferred when the major sources of disturbances are shocks to out- 
put (supply) which are independent of demand. The differences arise 
because Fischer is concerned with the variance of consumption, which 
is defined to be the difference between nominal income and the 
balance of payments. He obtains his result since, under fixed rates, 
balance-of-payments outflows tend to offset part of the increase in 
consumption caused by the increase in income. In Lipschitz’s model 
changes in the exchange rate cause wealth effects which tend to move 
in the same direction as the effects of the change in income. Thus, 
since fixed rates eliminate the wealth effects, they are variance 
minimizing. 

The third case to consider is that in which the variance of the dis- 
turbance to the IS curve (caused, e.g., by large disturbances to the 
investment demand function) is large relative to the variance of the 
other disturbances. Setting o2 = o2 = o2 = o2 = 0, we obtain 


Trim! iE = c3(ay om bo)? /[ by + Ay(boc, + c3)P. (25) 


Under the presumption that cz, the elasticity of money demand with 
respect to the exchange rate, is small, (25) indicates that a money rule 
will always yield a smaller variance of output against output demand 
disturbances. This result occurs because under an exchange rate rule 
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the nominal interest rate is fixed. Thus, any disturbance to the IS 
curve must be cleared by price-level and output changes. However, 
under a monetary rule part of the market equilibration can occur 
through changes in the interest rate. This lessens the amount of 
price-level change necessary to clear the market. Consequently, the 
amount of output response which must occur as a result of the shock 
to the IS curve will be smaller under a money rule. This is similar to 
Lipschitz’s (1978) finding that if the major sources of disturbances are 
shocks to demand which are independent of real output, then an 
economy should use floating rates. Our result also agrees with Tur- 
novsky’s (1976) finding that, with a high degree of capital mobility, 
flexible exchange rates yield a higher degree of output stability than 
fixed exchange rates. 

Finally, we consider the case in which the variances associated with 
the choice of the exchange rate regime itself are large relative to the 
variances of the other disturbances in the model. Setting oj = 0? = 
oa? = 0 we obtain 


C2 y/ G27 =(02/02,){ (a, + bo)? Mbe + a,(bee, + €3)P}. (26) 


Thus, (26) indicates that a money rule will be preferred to an ex- 
change rate rule if unanticipated changes in the money supply cause 
smaller shifts in the aggregate demand curve than do unanticipated 
changes in the exchange rate and vice versa. This result can be 
obtained by solving the demand side of the model along the lines used 
to obtain the slopes of the aggregate demand curves for the two 
regimes. 


Vv. Summary 


We can summarize our findings in this paper as follows: 

1. We find that we cannot determine that one type of monetary (or 
exchange rate) policy will always dominate the other. However, if we 
are able to determine that the variance of one type of shock to the 
economy is large relative to the other shocks, then the above analysis 
indicates that one type of rule can be shown to dominate the other. 

2. On the other hand, there may be some merit in the discussion of 
the merits of freely fluctuating versus managed exchange rates based 
upon the degree of openness of the economy if the shocks to the 
domestic aggregate supply curve have the largest variance. The dis- 
cussion in Tower and Willett (1976, pp. 44—45) indicates that b., the 
elasticity of the demand for output with respect to the relative price of 
domestic and foreign goods, is larger the more open the economy. 
Thus, the more open the economy, the more likely it is that a money 
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rule (freely fluctuating exchange rates) will yield a smaller variance of 
output in response to shocks to the aggregate supply curve." 

3. Since policy parameters do not appear in the variance of output 
under an exchange rate rule, fixed rates are as good as any other form 
of managed rate in terms of stabilizing output. Thus, arguments for 
managed rates as opposed to fixed rates must be based upon differ- 
ences in price level variability or some other criteria. However, since 
the variance of output will be different under flexible than under 
managed exchange rates, output stability can be used as the criterion 
for deciding between the two types of exchange regimes. 

4, Real variables in the economy can be insulated from changes in 
the world nominal interest rate under a managed exchange rate 
scheme, whereas the real exchange rate and the real interest rate will 
be affected by changes in the world interest rate under flexible ex- 
change rates. 


Appendix 


We now solve the model for the symmetric information case in which infor- 
mation in period ¢ is assumed to be available to agents in making their 
commodity and money demand decisions. Thus, we use the model 


Ye = a,(P;, — E,_,P,) + py (aggregate supply) (Al) 
ve = byl — (EePin1 — Pz)] + bo(e: — Ps) + »% (IS) (A2) 
M = CoPy + Cre + Cole + Cze, + 4 (LM) (A3) 

) =2 + E,e,_, — ; (interest rate parity). (A4) 


In (Al), we assume a, = 0 for simplicity. 
We first consider the money rule case. When e; is endogenous and letting b¥ 
= b, — b, and c¥ = c3 — C2, we solve (Al) through (A4) for P; and obtain 
Py = R¥{ (Gyeg + O¥c,)E, P, + c¥b,[% + Ey(C4. — Pr41)1 
+ bkm, — b¥co(t + Epets1) + OF}, 
where 0% = [—(c¥ + b¥c,)e + c¥#v, — b¥q,] and k¥ = [b¥(1 — co) + ayc# + 
CO. 


Since (A1) must hold for all periods, it must hold for period ¢ + 7, where j > 
0. Thus, 


(A5) 


Jers = O1(Pess — Bets Pees) + Mess. (Al’) 
Applying the operator E; to both sides of (A1’) yields 
Eyes as J > 0. (A6) 


16 Since Flood (1979) assumes that purchasing power parity holds, he is concerned 
with the case in which b, — ©. He finds that var (P; — E;_,P;) is not affected by changes 
in oj under fixed exchange rates, whereas it is affected by changes in o7, under flexible 
exchange rates. Taking the variances of (6) and (17) and letting 6, — © shows that 
Flood’s result also holds for our model. 
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Equation (A2) must also hold for all periods. Applying the E; operator to 
both sides of (A2) for period ¢ + j and using (A4) and (A6) yields 


0 a bv ta Eu (Cetj41 Fr Prijt1)] STS bFE 1 (e4; Saat PN (A2’) 


This first-order difference equation in E;(e4; — P4;) can be solved forward, 
yielding 


Er (ets — Pry) = —b,2/b¥, Os (A7) 


Equation (A3) must also hold for all periods. Applying the E; operator to 
both sides of (A3) for period ¢ + j and using (A4) and (A6) yields 


Ermer; = — CoE t(Ct45 — Pesj) + Cot + CoP re rsj41 


+ (1 — ce) Ecees541. 


(A3") 


Equation (A3’) can be solved forward, and using (A7) we obtain 


00 


Exersj3 = — (C2 + Cob,/by + (1 — cg)" Ss (<=) Eymej+i- — (A8) 
2 


i=0 


After substituting for E;(e4; — P41) using (A7) and E;(e,,) using (A8), we 
apply the E;_,; operator to both sides of (A5) and subtract the result from (A5) 
to obtain 


Py — Ey-yP, = k* (OF + b¥e,). (A9) 
Substituting (AQ) into (A1) yields the solution for real output as 
Yt = k*[bD¥(1 — cy)pe + c#Yy + DF(Er — N)I. (A10) 


And from (A10) we find that the variance of output under a money rule will 
be 


orm = (k*PL(bEP(L — coPon + (c#Pot + (b¥)*(oe + o7n)]. (All) 


Further, if we were to solve for the real exchange rate and the real interest, 
we would once again find that they were independent of the systematic part of 
monetary policy but not independent of the world interest rate. Thus, these 
results do not differ much from those in the text. 

We next consider the case of the exchange rate rule. When m is endogen- 
ous, we solve (Al) through (A4) for P; and obtain 


Py = {ay Ey P¢ — poe + Oylt + Ee (C1 — Pess)] (A12) 
+ vy, + b¥e,}/(a, + bF). 


Following the same procedures used to obtain (AQ), we find that under an 
exchange rate rule 


Py — Eppa Pe = (—Me + + OF @,)/(D¥ + a4). (A13) 
Substituting (A13) into (A1) yields 
Ne = (OF Me + ayy, + a, b¥e,)/(b¥ + a4). (Al4) 


Therefore, the variance of output under an exchange rate rule is 
Cine = ((bF Pou + ajay + (a,b¥ Pool (b¥ + a). (A15) 


If we were to solve for the real exchange rate and the real interest rate, we 
would find they have the same properties with respect to systematic changes 
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in the exchange rate rule and changes in the world interest rate as do the 
solutions in the text. 

Further, when expectations are based upon information in period ¢, the 
slopes of the aggregate demand curve under a money rule and an exchange 
rate rule are 


( Oy: = —b¥(1 — c2)/(c¥ + €,5F) a) 
OP, M 
and 
( dy Shy (A17) 
OP, E 


respectively. Thus, the cases in which one type of rule will dominate the other 
are unchanged from the text. 
The only result which changes due to the change of timing of expectations 
is the time path of the response of the exchange rate to an unanticipated 
increase in the money supply. And letting the aggregate supply curve be (1) 
and the money rule be (12), it can be shown that the only difference in the 
effects of unanticipated changes in the money supply on the exchange rate 
due to the change in the timing of expectations will be in the de;/de, term. 
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Point Spreads versus Odds 





Gilbert W. Bassett, Jr. 


University of Illinois at Chicago Circle 


Why do bookies use the point-spread device for wagering on football 
games when odds betting on the winner is also feasible? One possi- 
bility is that point-spread betting is more fun than odds betting. The 
existence of point spreads, however, does not logically entail this 
“more fun” explanation. Given simple specifications of beliefs and 
betting behavior, and given that point spreads and odds are equally 
thrilling, it is demonstrated that the point-spread game can exist as 
the game produced by a profit-maximizing bookie. The disparate 
beliefs which create the demand for some form of betting market 
can also explain the particular betting game produced by the bookie. 


I 


The latest point spread on the Bears-Rams football game this 
weekend lists the Bears as nine-point underdogs. For a 10-cent fee a 
bookie will accept an even-odds one-dollar wager on the event B + 9 
> R, where B and R are the Bears’ and Rams’ final point totals. Based 
on your knowledge of the teams’ relative strengths, you assign a 
(personalistic) probability of .7 to the event B + 9 > R (and a proba- 
bility of zero to B + 9 = R). Hence, a one-dollar (plus the 10-cent 
bookie fee) wager on the Bears has an expected monetary value 
(EMV) of .7(1 — .1) — .3(1 + .1), or a positive 30 cents. Given the 


An earlier version of this paper was presented at the Econometric Society Meeting, 
New York City, December 1977. I want to acknowledge helpful comments from a 
referee at this Journal. 
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alternative of not betting, you as an EMV maximizer will wager that 
the Bears will beat the spread. 

Point-spread schemes characterize virtually all of the wagering in 
football betting markets. So-called odds betting, however, is also pos- 
sible and can provide for any particular individual an EMV identical 
to the point-spread game. If the odds are 6: 1 ona Bear victory (B > 
R) and if you assign a probability of .2 to this event (and a probability 
of zero to B = R), then the EMV of a one-dollar (plus the 10-cent 
bookie fee) Bears bet is .2(6 — .1) — .8(1 + .1), which is again 30 cents. 
Ceteris paribus, you are indifferent between the point-spread and 
odds wagers. Hence the question, Why do bookies produce the 
point-spread betting game? 

The existence of point spreads is commonly attributed to bettor 
preferences. This explanation is implicit in news articles on point 
spreads, and it is also the most frequently offered explanation in the 
very informal settings where I have posed the point spreads versus 
odds question. Under this view bettor preferences are thought to 
depend on more than EMV or expected utility—utility being defined 
on income or wealth. Point spreads are said to be more fun or thrilling 
than odds so that bettors are willing to forgo expected return with 
odds in order to play the point-spread game. This appeal of point 
spreads could reflect a general preference for even-odds wagers, or it 
could be simply that point spreads are more complementary with 
viewing a football game. But the bias in favor of even-odds wagers 
does not extend to other lotteries, and many football bettors presum- 
ably wager on games they do not watch. It would appear that the 
majority (in the informal settings) think the intrinsic delight in beating 
the spread is obvious and requires no deeper analysis. The answer to 
the point spreads versus odds question, then, is simply that point 
spreads are more fun than the odds betting game. 

Bettor preferences provide an answer to the point spreads versus 
odds question, but the explanation posits rather curious tastes which 
cannot be understood in terms of an EMV or expected utility decision 
criterion. It is possible, however, that preferences are in fact consis- 
tent with some more traditional decision criterion, and the stated 
views merely reflect the difficulty of imagining football betting with- 
out the point-spread device. Alternatively, the stated views might 
derive from the belief that bettor preferences are the only logical 
explanation consistent with the existence of point spreads. Without 
denying the possible empirical validity of the bettor-preferences ex- 
planation, the intent here is to generate alternative hypotheses for 
point spreads in a more traditional setting where bettors do not 
consider point spreads more fun. Disparate beliefs regarding the 
random variable X = R — B are one reason for the existence of some 
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betting market, and they are examined here as also providing an 
explanation for the particular betting game produced by the bookie. 

The market of potential bettors will be assumed fixed, and each 
bettor will be described by a probability distribution representing 
beliefs regarding X. In deciding on wagers the bettors are assumed to 
use an EMV criterion. The thrill of betting is therefore supposed 
constant over alternative betting games, and risk preference or aver- 
sion is ignored. (Bettors might be imagined to risk only a small frac- 
tion of their income or wealth.) Betting is facilitated by a profit- 
maximizing bookie who is paid a fee by participating bettors in 
exchange for either a point-spread, an odds, or a combined-betting 
game. To insulate himself from the outcome of X the bookie will also 
establish equilibrium parameters for the betting games. (The betting 
market is of some independent interest because it contains an agent, 
the bookie, who has an incentive to establish these equilibrium values.) 
The betting market is therefore envisioned as arising for purely 
speculative reasons. The market exists for the sole purpose of ex- 
pediting the exchange of wagers whose EMV’ are positive rather than 
because of variations in initial allocations of physical-location-dated 
commodities or the possibility of utility-increasing risk transfers. Wa- 
gering is rational in that each bettor is made better off according to his 
EMV criterion even though it is certain that the aggregate of bettors 
will lose money to the bookie. 

The bookie’s profit-maximizing objective translates in the games 
described below into the objective of maximizing the betting pool— 
the total dollars wagered on the football teams. Given the assumptions 
that no wagering scheme is intrinsically more fun than any other and 
that all individuals wager rationally according to an EMV criterion, 
and given the example above which illustrates that for any individual 
there is some odds game in which betting behavior is identical to the 
point-spread game, it might seem that the bookie would be indiffer- 
ent as to which game to produce. In the model, point spreads would 
then exist merely as the outcome of the bookie’s random choice from 
a set of equivalent betting games. If a deterministic rationale for point 
spreads is sought, the model would then have to be abandoned in 
favor of the bettor-preference explanation. We will see, however, that 
depending on beliefs, certain betting games generate greater profits 
for the bookie so that an appeal to the supposed delights of point 
spreads is not logically necessary in order to understand the bookie’s 
production decision. 

The rationale for assuming that individual beliefs are consistent 
with a probability distribution comes from the personalistic or subjec- 
tive theory of probability, and Winkler (1971) has found some empiri- 
cal support for the theory in the football context. In the economics 
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literature, I know of only two references dealing directly with football 
betting.’ Pankoff (1968) used football betting to analyze issues relat- 
ing to the efficient-market hypothesis. Smith (1971) examined betting 
markets because of the equilibrating role played by the bookie and 
because football wagering could provide an unusual framework for 
examining decision making under uncertainty. He used the 
expected-utility hypothesis to analyze how betting behavior under 
various betting schemes might be influenced by wealth, beliefs, and 
the pleasures of gambling. While Smith did consider behavior under 
point spreads and odds games and his model is similar to the one 
presented here, he did not examine the conditions under which one 
betting game is produced over another. 

No attempt is made here to associate differing opinions with differ- 
ent or incomplete information among potential bettors. In fact, po- 
tential bettors might be envisioned as having virtually identical infor- 
mation on the football teams’ current situation and past history. Still, 
bettors are not presumed to agree on the distribution of X. While my 
casual observations suggest that this is a reasonable working hypothe- 
sis, it does conflict with the spirit of rational expectations where 
differing opinions are associated with and analyzed in terms of some 
more fundamental differences among individuals. Besides casual 
empiricism, the fact that the probability assigned to a future event like 
the winner of a football game cannot be as easily inferred as the 
probability assignments in, say, roulette might also serve to justify the 
existence and persistence of differing opinions. Despite being able to 
continuously update their theories by observing the outcomes of 
football games, potential bettors do not reach agreement. Theories 
are sufficiently vague or depend on so many exogenous factors that, 
while each bettor possesses a well-defined distribution for X, dif- 
ference of opinion exists and persists. Difference of opinion is what 
makes for a horse race and is what is examined here as an explanation 
for the point-spreads versus odds question. 

The plan of the rest of the paper is as follows. Section II describes 
the bookie’s objectives and production possibilities, that is, the games 
which can be offered to bettors. A simple parameterization of market 
beliefs and a betting criterion are presented in Section III. The 
assumptions are adopted in order to illustrate how point spreads 
might beat odds and also to determine in a simple model the condi- 
tions for the optimality of a betting game. Section IV begins the 
exercise by working through a special case. Propositions in Section V 
express equilibrium values for the betting games and bookie profits as 
a function of the parameters describing market beliefs. Section VI 


‘I am indebted to Stephen Salant for making me aware of these references. 
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then presents conditions under which point spreads beat odds. The 
assumptions on beliefs and betting criteria are relaxed in Section VII 
and another example is presented. The final section contains some 
concluding remarks. 


II 


The bookie brings together individuals with disparate beliefs regard- 
ing X by producing a betting game called spread-odds (s,z). Depend- 
ing on the bookie’s choice of the spread parameter s and the odds 
parameter z, the spread-odds (s,z) game is either the point-spread 
game, the odds game, or a game combining features of both point 
spreads and odds. The spread-odds (s,z) device, apparently first used 
by Smith (1971), is convenient for stating results and for obtaining 
conditions under which point spreads beat odds and any other 
combined-betting game. 

The bookie collects a fee or commission c for each bet purchased in 
the spread-odds (s,z) game. The commission is assumed fixed so that s 
and z are the bookie’s only choice variables.2, Furthermore, the 
bookie’s administrative and other costs per bet are assumed to not 
depend on the particular spread-odds (s,z) game which is produced. 

Bettors are offered a Rams bet or a Bears bet in the spread-odds 
(s,z) game (bettors will also have the option of not betting). The bets 
are a function of (s,z) where —~© <s < © and 0 <z < 1. Each bet is 
described by its net gain, its net loss, and the event under which the 
gain is realized. The spread-odds (s,z) bets are: Rams betty (2e—es 24 
2) ¢)X 2s]; Bearsibet; [2(1s2) =ent2z ee <s]. An individual 
who purchases one Rams bet will pay the bookie 2(1 — z) + ¢ dollars 
and win two dollars gross, 2z — c dollars net, ifX >s.1fX <s the Rams 
bettor loses his initial payment of 2(1 — z) + ¢ dollars.* Similarly, a 
Bears bettor wins 2(1 — z) — c dollars per bet if X <s and loses 2z + ¢ 
dollars* perm betaia. = s:, 


? In practice the bookie’s commission is also a choice variable whose equilibrium value 
is determined by the betting industry’s market structure. For an interesting study in a 
slightly different context of how the choice of ¢ can influence (a race track’s) profit, see 
Gruen (1976). To partially justify the c fixed assumption used here, one might suppose 
a oe he betting industry in long-run equilibrium with the average cost per wager 
equal to c. 

31f X =s all bets are off. This differs from some betting games in which the bookie 
wins all ties. 

4 The odds parameter z is a transformation of what is usually referred to as “the 
odds.” E.g., if Z = 2/3 and c = 1/99, then the Rams bet is established at (131/99: 67/99) 
and the Bears bet at (65/99: 133/99). In the usual parlance this represents approxi- 
mately 1.96 to 1 odds and 1 to 2.05 odds, respectively, the slight asymmetry in the odds 
being the result of the fee c. In order that the net gains and losses in the spread-odds 
(s,z) game be positive, the domain of z must be slightly restricted to the interval (c/2, 1 — 
c/2). 


POINT SPREADS VERSUS ODDS 757 


The bookie’s choice of a spread-odds (s,z) game is restricted to the 
set of (s,z) which makes profit independent of the outcome of the 
football game. This domain for (s,z) is called the equilibrium set and is 
given by V = [(s,z) | Ap(s,z:¢) = Ag(s,z:c)], where Ag and A, denote 
the number of Rams and Bears bets purchased at the spread s, the 
odds z, given the commission c. 

To see that (s,z) in V makes profit independent of X, consider the 
betting pool (the total dollars wagered on the Rams and Bears), [2(1 — 
z) + c]Ag + [2z + c]Ag. For (s,z) in V the pool is (1 + c)A where A = Ap 
+ Ag. From the pool the bookie returns 2Az dollars to Rams bettors if 
X > s, and at (s,z) in V this is identical to the 2A, dollars returned to 
Bears bettors if X < s. The bookie’s profit is therefore cA(s,z:c) with 
certainty for each (s,z) in V. 

A point-spread game is spread-odds (s, 4); that is, z is fixed at 4. 
With this restriction, wagers are made at (neglecting c) even odds on X 
> s or X <s. The equilibrium point spread is 5 where (5, 4) is in V, 
spread-odds (s, ¥2) is the point-spread game which the bookie may 
produce, and the profit from producing this game is cA(s, %4:c). 

An odds-betting game is spread-odds (0, z) so that bets are made on 
the events X > 0 or X < 0, Rams win or Bears win. The equilibrium 
odds is denoted by z where (0,2) is in V, spread-odds (0,2) is the odds 
game which the bookie might produce, and the profit from this game 
is CAO 26)2 

The profit-maximizing bookie who is permitted to produce any 
spread-odds (s,z) game will select an (s*,z*) in V so that A(s*,z*:c) is a 
maximum, and spread-odds (s*,z*) is called an optimal betting game. 
If A is constant over all (s,z) in V then every betting game is equivalent 
and optimal, and the bookie’s profit is identical no matter which game 
is produced. This situation corresponds to the initial hypothesis 
under which the bookie is indifferent between alternative betting 
games. If A(s*,z*:c) > A(s,z:c) for some (s,z) in V, then some betting 
games will not be produced because they result in lower profits than 
an optimal game. Finally, when A reaches a maximum for a unique 
(s,z) in V, there is a unique optimal game, and profit maximizing by 
the bookie completely determines the spread-odds (s,z) game which is 
produced. 

The bookie who is constrained to either point spreads or odds will 
produce the former when A(s, %:c) > A(0,z:c). When this holds we 
will say that point spreads beat odds. Notice that point spreads, even 
if it beats odds, need not be an optimal game. Finally, an extreme case 
to be considered in Section VII occurs when point spreads beat odds 


° In order that the point-spread and odds games are not identical, it is supposed that 5 
is not equal to zero. 
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and A(0,z:c) = 0; that is, no one purchases the odds wager. In this 
case we will say that point spreads versus odds is no contest. 


Remarks 


1. In practice a bookie who fails to select an (s,z) in V will try to 
exchange some of his wagers with another bookie or will begin to alter 
the spread and odds parameters. Hence, all bets may not be pur- 
chased at the same (s,z). In the model here, however, all bets are 
assumed to be made at an (s,z) in V. One might suppose that the 
bookie’s skill is his ability to read the market and know the (s,z) such 
that-A, = A: 

2. The equilibrium set corresponds to one possible measure of 
consensus regarding the distribution of X. If m(s) = z for (s,z) in V, 
then 7 isa market probability distribution in the sense that the fraction 
of the betting pool (excluding the bookie fee) wagered on the event X 
<s is z, and the fraction wagered on X >s is 1 — z. Bookies who fail to 
select (s,z) in V become bettors themselves and effectively are betting 
that their opinion is more accurate than this market consensus. 
Natural selection probably eliminates these bookies quickly either 
because they have become wealthy and retire or, more likely, because 
they have become financially or physically bankrupt, the latter bank- 
ruptcy being a likely consequence of not paying winning bettors. 

3. In the standard point-spread game the bookie’s commission 
(called the vigorish) results in spread bets being offered at 10:11 
odds. With the spread-odds (s,z) notation this means that the point- 
spread bet is offered at odds of 20/21:22/21 and c = 1/21. 


Ill 


The set of potential bettors is denoted by N and remains fixed 
throughout. Associated with each individual 7 in N is a probability 
distribution F(x) and a betting function; F(x) is the ith bettor’s proba- 
bility assignment to the event X < x, and the betting function along 
with F(x) determines the number of Rams or Bears bets purchased by 
bettor 7. 

Aside from the example in Section VII, beliefs will be assumed 
identical except for a location parameter. With this simple specifica- 
tion F(x) = G(x — y;), and G is further taken to be a symmetric (about 
zero, G(x) = 1 — G(— x)) distribution with continuous derivative g > 
0. Given G, the ith bettor’s beliefs are completely determined by yi, 
the bettor’s opinion regarding the median-mathematical expectation 
of X. The distribution of the y; over the set of potential bettors is 
approximated with N large by the continuous and increasing dis- 
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tribution M (y — ,)—the proportion of potential bettors who believe 
the median-mathematical expectation of X is less than y. The distribu- 
tion M is further assumed symmetric about zero so that btm is the 
median or average of all bettors’ beliefs regarding the median- 
mathematical expectation of X. Market beliefs under this simple rep- 
resentation are therefore a location family of distributions depending 
on G, M, and py. 

A potential bettor will participate in the betting market by pur- 
chasing a positive quantity of the Rams or Bears bet if and only if the 
EMV of the corresponding bet is positive. With this requirement the 
set of potential bettors may be partitioned as follows. 

Given a spread-odds (s,z) game the EMV of one Rams bet for bettor 
2 is 2z — c — 2G(s — wi). The requirement that this be positive can be 
expressed directly in terms of pi; as w; > s — Q(z — c/2), where Q(p) is 
the (increasing) inverse or pth quantile of G. Making similar calcula- 
tions for the Bears bet gives the partitioning, Rams bettors = [i in 
N| mi > s — Q(z — c/2)], Bears bettors = [i in N| mi <5 Oe 6/2) ); 
and the nonbettors are those individuals whose 4; is the interval [s — 
Q(z + c/2),s — Q@ — c/2)]. This holds for arbitrary, not necessarily 
equilibrium, values for s and z. To determine V requires further 
assumptions which describe the quantity of wagers purchased when 
the EMV is positive. 

Let A,“ and A,2 denote the number of Rams and Bears bets pur- 
chased by the 7th individual. The betting rule assumed here is A;* = 
Alpi — 5 + Q(z —c/2)], and A? = Als — pw, — Q(z + ¢/2)], where \(w) isa 
positive and nondecreasing function for w > 0 and is zero otherwise. 
This rule is consistent with the positive EMV requirement. If A(w) = 1 
for w > 0, bettors are restricted to a single bet when its EMV is 
positive. If A(w) is increasing for w > 0 the quantity of bets purchased 
is based not on EMV directly but on the difference between pu; and s — 
Q(z — c/2) or s — Q(z + c/2), depending on which wager’s EMV is 
positive. If uw, > ub, > s — Qi — c/2) the EMV of one Rams bet is 
greater for bettor | and this bettor purchases more Rams bets than 
bettor 2, A,” > d,* > 0. With this betting rule we will have A," = d,? > 0 
(i and 7 have equal but opposite opinions) when y; and yu; are equidis- 
tant but on opposite sides of the interval which describes the nonbet- 
tors. 


Remarks 


1. The assumption of a large number of individuals with well- 
defined distributions defined on X = R — B is an empirical proposi- 
tion which is probably not unrealistic (see Winkler 1971). If X repre- 
sents an arbitrary future event, however, the assumption need not be 
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satisfied, and the model will not be applicable.® For example, if indi- 
viduals do not possess a subjective distribution on the electoral (or 
popular) vote point spread, then the model cannot explain why odds 
beats point spreads in wagering on the outcome of a presidential 
election. 

2. Under the football game interpretation, each potential bettor 
knows that X = R — B is an integer-valued random variable. There- 
fore, there is the assumption that the outcomes of R — B are 
sufficiently numerous that F;(x) may be approximated by a continu- 
ous distribution. In other cases this may be far from adequate, for 
example, baseball and soccer games. In the limit where the winner 
scores one and the loser zero, no point-spreads versus odds question 
will arise. 

3. The location family of distributions is a very restrictive assump- 
tion which undoubtedly does not allow for the full diversity of actual 
beliefs. The assumption is adopted because it permits easy calculation 
of optimal game conditions and thereby will illustrate, albeit in a 
restrictive setting, how one betting game can be favored over another. 

4. The assumed betting criterion is also adopted because it is con- 
venient. With a location family of market beliefs, neither the equilib- 
rium set nor the optimal game conditions will be found to depend on 
d. But the criterion might also be fairly realistic since bettors often 
rationalize their wagers by appealing to the difference between the 
quoted spread and the spread at which they would decide not to 
wager. 


IV 


Before examining the conditions for an optimal game given an arbi- 
trary location family of market beliefs, it is instructive to consider the 
following special case. 

Suppose G is a uniform distribution on the interval (— 10, 10); G(x) 
= .5 + .05x. Letc = .1, wy = 3, and let M be an arbitrary distribution 
satisfying the previous assumptions. 

A Rams bettor is anyone who finds the EMV of one Rams bet to be 
positive or, on making the calculations, uw; > s — 20z + 11. Under the 
assumed betting criterion such a bettor purchases A,” = A(u; — T — 4) 
Rams bets, where we let T denote s — 20z + 7. Summing (integrating) 
over all such purchases gives 


* The assumption can fail in either of two ways. Beliefs might be so vague that 
individuals feel completely incapable of assigning probabilities to future events. Alter- 
natively, beliefs might be inconsistent with a probability distribution, e.g., if one as- 
signed probabilities of .3 and .8 to the respective events X < 0 and X > 1. 
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A® 


n ‘jee Nia Sea Rey 


leer AGH Th = dM Ww), 


where n is the number of potential bettors. Making similar calcula- 
tions gives 


AB = nfo, M(—w + T — 1)dMw). 


This expresses the total quantity of Rams and Bears bets purchased 
given an arbitrary spread-odds (s,z) game. 
To solve for the (s,z) in V use the symmetry of M and obtain 


Aiea ARe= nf Aw — T — 1)dM(w) 


w>T+1 
= aes Mw + T — 1)dM(w). 


At T = 0 this is zero and, since M is increasing, A* — A? = 0 only if 
T = 0. Hence, T = 0 or s(z) = 20z — 7 gives the equilibrium set; at any 
[s(z), z] the bookie’s profit will not depend on the realization of X. The 
equilibrium point spread would be s(¥2) = 3, and the equilibrium odds 
is a solution to s(z) = 0 or z = .35. 

Substituting s(z) for s in the expressions for A*® and A? gives the 
representation of profits for (s,z) in V as 


c(Ap + Ag) = gen [ A(w — 1)dM(w), 


a term which does not depend on (5,z). So why does the bookie 
produce one game, say point spreads, over another? No (determinis- 
tic) answer is to be found in this example. The bookie’s profit is 
identical in all spread-odds (s,z) games, every game is optimal, and 
disparate beliefs as represented here do not offer an explanation for 
the production of a particular betting game. In a world where the 
assumptions of this example are satisfied, the existence of point- 
spread betting would merely represent the bookie’s random choice 
from the set of all spread-odds (s,z) games. 

To put this result differently, consider the point-spread game with s 
= 3. Instead of computing the equilibrium odds, mimic the example 
in the introduction and ask each bettor to compute the odds game at 
which his behavior would be identical to behavior under point 
spreads. For example, a Rams bettor with point spreads will purchase 
d(v;) bets where p; — 4 = v; > 0. An odds game in which the same 
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number of Rams bets would be purchased is found by solving mw; — 11 
— 20z = v;, orz = .35. Every Rams and, with similar calculations, every 
Bears bettor will obtain this same solution, and at z = .35 every 
nonbettor under point spreads remains a nonbettor under odds. The 
behavior of each individual is therefore invariant under the trans- 
formation from (s,z) = (3, %) to (0, .35). Since (3, ¥%) is in V and 
because no behavior is altered at (0, .35), it follows that .35 is the 
equilibrium odds. The point-spread and odds games are therefore 
equivalent. 

In this example any individual can find an equalizing odds game, 
and this is generally true as long as bettors use an EMV -based decision 
criterion. What makes this example special and misleading is that 
every individual arrives at the same equalizing odds of z = .35. When 
G is not uniform we will see that not all bettors compute the same 
equalizing odds, and the invariance of betting behavior as (s,z) varies 
over V will no longer hold. 


Vv 


For the spread-odds (s,z) game with arbitrary (s,z), the EMV-based 
betting criterion provides a straightforward characterization of A® 
and A® asa function of G, M, and wy. The equilibrium set can then be 
obtained by solving for the (s,z) which satisfy the equilibrium condi- 
tion. The betting pool and bookie profits for the spread-odds (s,z) 
game with (s,z) in V then follows directly. 
Proposition 1: For arbitrary (not necessarily equilibrium) (s,z), 
Ar = rll A(w — T — A)dM(w), 


w>T+A 


AB = ‘on A\(-w + T — A)dM (w), 


where T = s — wy — [Q(z + c/2) + Q(z — c/2)]/2 and A = [OG + c/2)— 
Oe c) 2) 2s 

Proof: ‘This follows from the assumed betting criterion, the repre- 
sentation of market beliefs, and the direct substitution w = 1 oe Dr 

Proposition 2: The equilibrium set is V = {(s,z) |s(z) = pit (Os 
Cl 2)eteO Ga c/2)\/2}. 

Proof: By the symmetry of M we have 


A® — A? = nf hw — T — Aya ~ nf Aw T = A)dM. 
w>A+T w>A-T 


This expression is zero at T = 0 and, using the assumption that M is 
increasing, it may be verified that A® — A? < 0 for T > 0 and A® — A® 
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> 0 fort < 0. Hence A® — A® = 0 if and only if T = 0, and this is the 
proposition. 
Proposition 3: Bookie profits for [s(z), z] in V are 


c(A* + A*) = enf ~ A(w — A)dM(w). 


An optimal game is therefore an [s*(z*), z*] such that A(z*:c) = min 
A(z:c). i 

Proof: The first part of the proposition follows on substituting s(z) 
for s in proposition | and then using the symmetry of M. The second 
part follows on noting that with M increasing the expression for 
profits increases as A decreases. 

Substituting s(z) for s in the previous expression for nonbettors, we 
find that as (s,z) varies over V, the nonbettors are those individuals 
whose 4; fall in the interval [wy — A(z:c), wy + A(z:c)]. Since the pw; are 
symmetric about #4, the proportion of potential bettors who purchase 
no bets is 1 — 2M[A(z:c)]. Since M is increasing this proportion is 
minimized at min A(z:c), which is the same as the maximum-profit 
condition. The objectives of maximizing profits and maximizing the 
participation of bettors in the market are therefore equivalent, given 
our assumptions on beliefs and betting behavior. 

When A (z:c) is a constant function of z, all betting games will yield 
equal profits and an equal number of bettors. This occurs when Q is 
linear (for 0 < p < 1), and this in turn implies that G is uniform, the 
example of the previous section. Conversely, when G is not uniform A 
is not constant, and some betting games will not be produced because 
they generate lower profits. The bookie fee in effect drives a wedge, 
consisting of the nonbettors, between Rams and Bears bettors, and 
the size of this wedge depends on which spread-odds (s,z) game is 
produced. 

Finally, consider the point-spread game with the equilibrium 
spread py. If Rams bettors compute an odds game at which their 
behavior is identical with the point-spread game, they will find zz = 
GAO Clee) / 2) Te + c/2. The Bears bettors will compute zg = 
Gro Es) 2) iu} — c/2. If G is not uniform the equilibrium odds 
are not equal to zz or zz so that the odds game withz = zz or zg is nota 
feasible game for the bookie. The existence of equalizing odds for any 
given bettor does not therefore imply that all betting games are 
equivalent. The point-spreads versus odds question depends on bet- 
ting behavior not at equalizing odds but at the equilibrium odds, and 
unless G is uniform these two odds will not be equal. 

Given the assumed beliefs and betting criterion, proposition 3 gives 
the optimality condition for spread-odds (s,z); namely, the bookie 
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produces an [s(z), z] game such that A(z:c) is a minimum. The next 
section translates this condition into simple conditions on G. 


Remark 


Let the (median) consensus probability of the event X < s be denoted 
by 77,(s) where half of the individuals in N believe P (X <s) <7y(s) and 
half believe P(X < s) > 7s). Suppose further that this consensus is 
correct: If 74s) = z for a large number of football games, then z is the 
frequency of X <s. It may be verified that for the present model 77s) 
= G(s — py). Furthermore, if c = 0 everyone in N purchases a wager 
(the set of nonbettors has probability zero), and 7y is revealed by the 
equilibrium set since V = [(s,z) | my(s) = z]. However, from proposition 
2 we see that withc > 0 there will exist discrepancies between 7(s) = z, 
(s,z) in V and the true measure of the market’s opinion 7s). This 
discrepancy arises because with c > 0 some potential bettors do not 
wager, and their opinions are not fully represented by the equilibrium 
set. For example, if c = .2, G is a mean zero, variance 49 Gaussian 
distribution and wy = 7, then one finds the equilibrium odds is z = 
.179, and this is a biased estimate of the potential market’s belief 
regarding X < 0 since 7(0) = .159. Hence z will not reflect the 
frequency of X < 0 even if the consensus beliefs of potential bettors 
are correct. Such discrepancies might be a reason for the observed 
difference between the probabilities implied by the odds in horse race 
betting and the frequency with which horses win (see Ali [1977, p. 
810] for a similar result; see also Seligman [1975]). 


VI 


Suppose in addition to previous assumptions that G is strictly uni- 
modal; that is, g(0) > g(x) for allx and g(x) increasing (decreasing) asx 
is negative (positive). With this assumption jy; is the expectation, me- 
dian, and mode or most likely outcome of X. In such a world we have: 

Theorem 1: If G is strictly unimodal, then point spreads is the 
unique optimal betting game. 

Proof: ‘The first-order condition for the minimum of 4 is q(z + Yc) 
= 92 —c/2) "0,08 g|0@ +"c/2)| 9 — p(O@ — e/2)) 1 Os With G 
symmetric and strictly unimodal this holds if and only if Q(z + c/2) = 
—Q(z — ¢c/2) = Q(1 —z + ¢/2), orz = ¥%. One may now verify that A is 
decreasing for z < 4% and increasing for z > % so that atz = 4, A is at 
its unique minimum value. By proposition 3, z = ¥, or the point- 
spread game is the unique optimal betting game. 

To obtain simple necessary and sufficient conditions which 
characterize the optimal game, suppose c is small and use a linear 
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approximation for Q(z + c/2). Expanding around c = 0 and suppress- 
ing quadratic and higher-order terms gives Q(z + c/2) = Q(z) 
+ q(z)c/2. Up to this order of approximation s(z) = wy + Q(z), z= 
G(—um) = 1 — Guy), and A(z:c) = g(z) = glQ(z)I". 

Theorem 2: For c small, spread-odds [s(z*),z*] is an optimal game if 
and only if Q(z*) is a global mode of G(x).7 

Proof: Minimizing A(z:c) with c small is approximately the same as 
minimizing q(z) = g[Q(z)]"!, and by definition this minimum is 
achieved at a mode of G(x). (Notice that if Q[z*] is a mode, then by 
symmetry so is —Q[z*] = Q[1 — z*], so that [s(1 — z*), 1 — z*] is also an 
optimal game.) 

Theorem 3: Forc small, point spreads beat odds if and only if g(0) > 
&(Mm). 

Proof: With c small, A(Z : c) = g(u)7! (since gl uy] = gl—py]) and 
A(':c) = g(0)' so that the theorem follows from proposition 3. 

It should be emphasized that the simple optimality conditions in 
this section are a consequence of the simple specification of beliefs 
and betting behavior. Under less restrictive assumptions there will 
undoubtedly exist different configurations of beliefs, betting behav- 
ior, and preferences which are revealed by the bookie’s production of 
point spreads. 

The conditions illustrate how market beliefs and betting behavior 
can be mapped via the bookie into a particular betting game even 
when betting games are considered equally thrilling. For example, 
under the assumptions of theorem 2, the existence of point-spread 
betting does not reveal bettor preferences but instead reveals an 
aspect of a bettor’s beliefs, namely, that 0 = Q(1%) is the mode of G. 
There is not a one-to-one correspondence between point spreads and 
the previously described bettor-preferences explanation, and such an 
explanation therefore is not logically necessary in order to under- 
stand the existence of point-spread betting. 


Vil 


In this section the location-family-of-market-beliefs assumption is 
dropped, individual betting is supposed consistent with the positive 
EMV requirement but is otherwise arbitrary, and a very simple exam- 
ple is presented in which point spreads versus odds is no contest. Only 
the point-spread and odds games are compared, and there is no 
attempt to obtain simple or general optimality conditions. The exam- 
ple is presented to illustrate how point spreads can be favored over 
odds when bettors use an arbitrary EMV -based decision criterion. The 


7x* is a global mode of G if g(x*) = g(x) for all x. 
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more specific criterion used in the previous sections, while influencing 
the particular optimality conditions, is not therefore crucial for the 
bookie to favor one betting game over another. 

Suppose N consists of just two potential bettors. Let c = 1/9 and 
suppose beliefs are given by the uniform distributions, 


Sos ee een! 
F(x) 1 iii ae ou 
FG) ee ~- Sars 


These distributions are depicted in figure 1. Observe that the dis- 
tributions do not differ by just location and are not therefore covered 
by the previous sections. Also note that there are disparate be- 
liefs both in the sense that F, # F, and also in the sense that 
beliefs regarding the event X < 0 are much less dispersed than 
beliefs regarding, say, X < 2. 

The EMV s of the Rams and Bears bets for both individuals given 
the point-spread game are graphed in figure 2 as a function of s. 
From the figure one sees that when 4/3 = s = 24/9, individual | is a 
Bears bettor and individual 2 is a Rams bettor. The equilibrium point 
spread will therefore be somewhere in this interval, with the particu- 
lar spread being dependent on each individual’s betting criterion. 
Under the criterion of Section III the equilibrium spread is 2; if 
individuals purchase a quantity of wagers which is proportional to the 
EMV of the wager, then s = 16/9. But in any event both potential 


1 i Fo 


1 


Fic. 1.—Beliefs of potential bettors 1 and 2 





Fic. 2.—EMVs of Bears and Rams bets for potential bettors given point-spread game 
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Fic. 3.—EMVs of Bears and Rams bets for potential bettors given odds game 


bettors will participate in the betting market by purchasing some 
point-spread bets. 

The EMVs of the Rams and Bears bets given the odds game are 
graphed in figure 3 as a function of z. For z < 2/9 the EMV of a Bears 
bet is positive for both individuals and for z > 5/18 the EMV of a Rams 
bet is positive for both individuals; hence, the equilibrium odds can- 
not be in either of these intervals. The only way to satisfy the equilib- 
rium condition is to let 2/9 S z S 5/18, and with any such odds both 
individuals purchase no bets. Therefore point spreads versus odds is 
no contest. If the point-spread device had not been invented and 
wagers could only be made on the winner of the game, this football 
betting market would not exist. 


Vill 


So what is the answer to the point-spreads versus odds question? 
Perhaps the answer is that point spreads are more fun than odds. 
Since this, however, is not the only logical possibility, the answer 
ultimately must be based on empirical evidence. In a world where 
point spreads are not considered more fun than odds, point spreads 
can exist as the game produced by a profit-maximizing bookie. Dif- 
ference of opinion, besides leading to the formation of a betting 
market, can also have implications for the particular betting game 
produced by the bookie. Finally, to profit from these findings you 
should look for an informal setting in a location-family-of-beliefs 
world where bettors use a criterion as described in Section III. If a 
profit-maximizing bookie is producing the point-spread game, you 
might then wager that your fellow bettors have expectations equal to 
their modes. 
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Two economies, represented by Diamond-type overlapping- 
generations models and differing only in their pure rates of time 
preference, are joined together. Capital formation, balance-of- 
payments behavior, and welfare are compared under autarky and 
openness. With a positive natural rate of growth, the low-time- 
preference country runs a current account surplus in the steady state 
but not necessarily outside it. If preexisting capital is not shiftable 
between countries, integration in the world economy makes the 
high-time-preference country worse off in the short run. The rank- 
ing of stationary utility levels under autarky and openness is am- 
biguous. 


I. Introduction 


The purpose of this paper is to explain international capital move- 
ments from differences in time preference and to evaluate the short- 
run and long-run welfare implications of a change from a situation of 
trade and financial autarky to one of openness in trade and finance. 
The analysis is conducted in terms of a deterministic model of two 
countries, each one of which consists of a Samuelson-Diamond type of 
overlapping-generations model and produces one identical good. All 


I would like to thank Jeff Carmichael, Jon Eaton, Smith Freeman, and Paul Krugman 
for useful comments. Two anonymous referees suggested significant alterations in an 
earlier version of the paper that were very helpful. The research reported here is part 
of the National Bureau of Economic Research’s Program in International Studies. Any 
opinions expressed are those of the author and not those of the NBER. 


[Journal of Political Economy, 1981, vol. 89, no. 4] 
© 1981 by The University of Chicago. 0022-3808/81/8904-0006$01.50 


769 


770 JOURNAL OF POLITICAL ECONOMY 


determinants of international trade and international lending and 
borrowing other than a single taste parameter, the pure rate of time 
preference, are ignored. There are no differences in technology, in 
exogenous factor endowments, in scale, and in market structure; nor 
are there taxes, subsidies, and other man-made wedges between 
buyers’ and sellers’ prices or between domestic and foreign prices. 
There is an extensive literature on the determinants of interna- 
tional capital movements. Starting from the accounting identity that a 
country’s net international capital outflow equals the difference be- 
tween national saving and domestic capital formation, a number of 
different approaches have been taken to the determinants of saving 
and investment. Ad hoc saving functions, mainly specifying saving as 
a constant fraction of disposable income, are the most common (Borts 
1964; Neher 1970; Fischer and Frenkel 1972; Onitsuka 1974; Hori 
and Stein 1977; Ruffin 1979). Stiglitz (1970) derived saving behavior 
from the utility-maximizing choices of infinite-lived individuals. The 
approach adopted here and in Fried (1980) has optimizing, overlap- 
ping generations with finite lives. This is especially useful for the 
short-run and long-run welfare analyses of a move from autarky to 
openness. Capital formation is carried to the point where the margi- 
nal product equals the interest rate. This is different from the 
exogenous investment of Neher (1970) and Onitsuka (1974), the ad 
hoc (Keynesian) investment function of Borts (1964), and the cost- 
of-adjustment investment function of Fischer and Frenkel (1972). 
Only a single good, which can be consumed or used as a capital good, 
is produced in both countries. This is the same as in Neher (1970), 
Onitsuka (1974), and Ruffin (1979). Stiglitz (1970) models production 
of and trade in distinct consumer goods and capital goods, as do 
Fischer and Frenkel (1972). Hori and Stein (1977) have distinct con- 
sumer goods and capital goods, but each of the two countries in their 
model is assumed to specialize completely in production of one of 
these goods. Borts (1964) distinguishes between traded and non- 
traded (labor-intensive) goods; traded goods are imports and exports; 
the traded goods can be either consumer goods or investment goods. 
The one-commodity model cannot be used to analyze the effect of 
capital mobility on the terms of trade, a concern of Borts (1964), 
Suglitz (1970), and Hori and Stein (1977). The one-commodity 
structure also means that trade in goods must be matched by interna- 
tional financial mobility. Stiglitz (1970) has trade in consumer goods 
and capital goods but no financial capital mobility. Fischer and Fren- 
kel (1972) have international trade in capital goods, in consumer 
goods, and in financial claims—as does Borts (1964), although there 
all capital goods are imported. Finally, this paper considers a two- 
country model, as do Stiglitz (1970), Hori and Stein (1977), and Ruffin 
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(1979). The small-country case was analyzed by Borts (1964), Neher 
(1970), Fischer and Frenkel (1972), and Onitsuka (1974). In a recent 
paper, Findlay (1978) has approached the problem of international 
trade, time preference, and interest equalization from an “Austrian” 
viewpoint. This paper can be viewed in part as a neoclassical comple- 
ment to Findlay’s analysis. Section II describes the autarky equilib- 
rium of the two countries. Section ITI analyzes the open, integrated 
world economy. 

One of the conclusions of the positive analysis, that in the steady 
state (but not necessarily outside it) the high-time-preference country 
runs a current account deficit, is hardly surprising and deserves 
mentioning mainly because of its rigorous foundation in optimizing 
behavior. The welfare analysis of autarky versus openness in Section 
IV presents some rather less conventional results. If preexisting cap- 
ital is nonshiftable between countries, the high-time-preference 
country is worse off under openness in the short run. The long-run 
welfare analysis is more complex but also fails to generate unambigu- 
ous welfare improvements for both countries under openness. 


II. The Autarky Equilibrium 


The world economy consists of two countries, the “home” country 
and the “foreign” country, identical in every respect except in taste. 
Each country is represented by competitive output and factor mar- 
kets, two overlapping generations, and an identical, well-behaved 
constant returns to scale production function f, a model due to 
Diamond (1965).! 


Notation 


c{ = consumption while young by a member of generation ¢; 
c? = consumption while old by a member of generation ¢; 
L, = size of generation ¢; 


K;, = capital stock in period ¢; 

k, = K/L 

w, = real wage in period ¢; 

r, = interest rate on savings carried from period ¢ — 1 into 
period ¢; 


n = one-period proportional rate of growth of population L;4, 
u(t on gn Us 

p = pure rate of time preference; p 2 0; 

A, = wealth in period ¢; 


1 The original development of the overlapping-generations model is by Samuelson 
(1958). See also Cass and Yaari 1966. 
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Qi = A,/Li; J 

B, = trade balance surplus of the home country in period ¢; 
by =B/Li; 

G; = current account surplus of the home country in period ¢; 
Sa G,/L;. 


Momentary Equilibrium 


The momentary or single-period equilibrium of the closed economy is 
summarized in equations (1)—(7): 


ee u (Ch. Gt), (1) 

Wer ter = cil ch tesa), (2) 
ci, c? = 0, (3) 

we =f (he) — Ref" (he), (4) 
re =f" (hi), (5) 

we — Ct = key +n); (6) 
k, = 0. (7) 


The utility function, u, is twice differentiable, strictly quasi-concave, 
and increasing in c' and c?. The quantity u,(0, c?) = u2(c!, 0) = +%; 
U,(®, c”) = ug(c', ©) = 0. The function f is twice differentiable; f(0) = 
O57, = 0577 = 0; f/(0) = + and f/(~) = 0, 

Individuals within a given country are identical within and across 
generations. People live for two periods, work in the first period of 
their lives, and retire in the second. Labor supply is inelastic. The 
individual’s optimization problem is given in equations (1), (2), and 
(3). The utility of lifetime consumption is maximized subject to the 
lifetime budget constraint (2). The conditions on u ensure that the 
budget constraint will hold with equality and that an interior solution 
will be obtained for c/ and c?. Equations (4) and (5) state that the labor 
market and capital rental market are competitive. The capital stock at 
the beginning of period t + 1 equals the value of the saving in period 
t. This economy-wide capital market equilibrium condition is given in 
equation (6). Real capital is the only store of value. All saving in period 
tis performed by the young members of generation t. The old dissave 
previously accumulated wealth. The conditions onf ensure that k, will 
be positive. 

When we consider the consequences of different pure rates of time 
preference, the utility function u(c}, c?) will be specialized to the 
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following additively separable form: 
u(ct, C2) = v(ct) + (1 + p)*v(c?). (8) 


Here p is the (constant) pure rate of time preference. More generally, 
the pure rate of time preference is the marginal rate of substitution 
between consumption when young and consumption when old, when 
equal amounts are consumed in both periods, minus one. It can be a 
function of the level of consumption: p(c) = [u,(c, c)/us(c, c)] — 1. 

The general solution of (1)-(7) is straightforward and can be 
found, for example, in Diamond (1965). The interior first-order 
conditions for the individual are: 


u(Ct, C?) 


ua(ct, C2) = 1A: (9) 


For the utility function of equation (8), this becomes: 


v'(ci)(1 + p) 


v'(¢) = oars (9') 


The consumer’s intertemporal equilibrium is graphed in figure 1. 
Equations (9) and (2) (the latter holding with equality) can be solved 
for c? and c? as functions of w; and 7444: 


Ct = c'(wy, Ye41), (10a) 
cf = c*(wyz, Te41).7 (10b) 
Similarly, equations (9’) and (2) can be solved to yield: 
Ct = C' We, TH1, P) (10a’) 
Cf = (Wi, T1441, P): (10b’) 


If consumption in both periods is a normal good, 1 > ci,, c2, > 0. We 
shall assume this to be the case. 














* ac} = ~ [ere = (1 + 141) o0] 
Ou; [try — (1 + %¢44)tor (1 + 1441)? — ter + (1 + 7441) tlo2 
Oct _ Us — (1 + Te41) "Lor — (1 + Te41)Ue22] 
O41 Uy — (1 + 74¢41)[2te1 — (1 + 7441) Ul22] 
OC _ Oc 
aaa + re)(1 ae ) 
Oc? 0c} G 
= = (Ul ae ——— + ——_—_—_.. 
Ori ( fer) Ores 1 + Te 
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consumption 
and income 
in period 2 


) 


wt ( Tele 


we (FE) 


a i Wt consumption 
and income 
in period 1 


Fic. 1.—The consumer’s intertemporal equilibrium 


Steady-State Equilibrium 


The steady-state or long-run equilibrium is that sequence of momen- 
tary equilibria in which the lifetime per capita consumption profile 
remains constant from generation to generation. It can be charac- 
terized by the following set of equations: 


s(C*, Go) = 
err 2) =e) ae (11) 
D(C Ces pan 
ge a (11') 
ee ee a2) 
w = f(k) — hf'(h), (13) 
tase 7: (h), (14) 
wo =k en), (15) 


By substituting equations (13) and (14) into (12) and (15), we obtain 
two equations that represent the stationary decentralized (or compet- 
itive) consumption possibility locus: 
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Uslope -(1+f (k}>k =) 
Ba 


ED 


—— slope -(1-n) = -(1r f (ks >ko)) 


—— slope -(1*f (kg) 


0 


Fic. 2.—The stationary competitive possibility locus 


2 


f(k) — bf'(h) = 08 + TF (16a) 
FR) —Bf(k) = c4 +k(1 +72). (16b) 

These two equations can be solved for c! as a function of c?: 
ch = p(c)); yi = - tn +) (17) 


(Vanja fF = ky”) 


For the Cobb-Douglas production function f(k) = k*, 0 < a < 1, the 
stationary decentralized consumption possibility locus has the shape 
of OF in figure 2.3 Along this locus: (dc'/dc?) = —{[1 +n + (a — 
l)ak*"]/(1 + n)(1 + a?k?~')}. At the origin (with k = 0), (dc?/dc') = 
[a/(1 — a)](1 + n) > 0. The capital-labor ratio, k, increases monotoni- 
cally as one moves from O to F. Ask approaches infinity (which would 


* The historical origins of fig. 2 are not completely clear. It was a well-established part 
of the “Yale blackboard tradition” when I came to Yale as a graduate student in 1971. 
James Tobin suggests that David Cass may have been the first one to utilize it. It has 
since been used as an expository device by Ihori (1978), Buiter (1979), and Carmichael 
(1979). 
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be for c! < 0), (dc?/dc') becomes — 1. Whenc! = OatF, (dc?/dc1) = —[1 + 
o?(1 + n)(1 — a) '\(1 — a) < 0. At the golden-rule capital-labor 
ratio, defined by f’(k) = n, (dc?/dc') is —[1 + f’(k)] = —(1 + n).4 I shall 
assume in what follows that OF is as in figure 2: strictly concave 
toward the c* axis for nonnegative values of c! and c”. This is certainly 
the case for the Cobb-Douglas production function but need not be 
true for alternative, otherwise well-behaved neoclassical production 
functions. It is, for example, quite possible for OF, while remaining 
strictly concave toward the origin, to be upward sloping at F. This 
would reflect extreme overaccumulation of capital: An increase ink at 
high values of k depresses the marginal product of capital to such an 
extent that reductions in both c' and c? are required to generate the 
additional saving required to maintain the higher value of k. 
A stationary competitive equilibrium is described by equations (16a) 
and (16b) and a tangency of a private budget constraint with slope 1 + 
Jf'(k) = 1 +, to an indifference curve, as given in (11) or (11’). At 
capital-labor ratios above the golden-rule level, the budget constraint 
cuts the OF curve from above if the production function is Cobb- 
Douglas. In what follows I shall assume that this condition holds. 
In figure 2 I have drawn such an equilibrium at E;, corresponding 
to a capital-labor ratio in excess of the golden-rule capital-labor ratio. 
This stationary competitive equilibrium is inefficient in the sense that 
any lower capital-labor ratio that does not lie below the golden-rule 
capital-labor ratio can sustain higher stationary paths of per capita 
consumption and does not require any sacrifice in consumption to 
move to these higher stationary paths if only physical resource con- 
straints are taken as binding. The physical resource constraint faced 
by an omniscient and omnipotent social planner is c?_,L;, + cll; < 
SF (he)Lt + Ki — Key1, or 


cz_1(1 SEP) Shc, Sf (kt) TS Rei] A) « (18) 


The stationary resource constraint is: 


* In what follows I shall assume that the golden-rule capital-labor ratio is less than the 
capital-labor ratio at F. This need not be the case. From (16b) it is clear that with a 
Cobb-Douglas production function, c! = 0 implies either k = 0 (at the origin) or (1 — 
ake" = (1 + n). (This requires that k be less than unity at F.) Thus, at F, A! = (1 + 
n)/(1 — a). At the golden rule, k* = n/a. It is likely, but not inevitable, that k at F 
exceeds the golden-rule capital-labor ratio. 

° The slope of the consumption possibility frontier, S,, is given by: S, = [(1 + n)(1 + 
oa? k*1)]/[(1 — a)ak* — (1 + n)]. The slope of the budget constraint, S5, is given by S, = 
—(1 + @k*"). When f'(k) = ak™ <n, both are negative. 


Ute (let Ochi 


S => 
1/82 1+n+ (1 +n)a?k*! + (1 — a)ak™'(n — ak) ~ 


This is < 1 ifn > ak*. Tam obliged to an anonymous referee for saving me from error 
here. 
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Cc? 
l1+n 





cee yt) nk: (19) 


A social planner aiming to maximize the stationary value of per 
capita resources available for consumption by the young and the old 
chooses & such that f’(k) = n. The social planner’s stationary con- 
sumption possibility locus is S$; — S, in figure 2. The decentralized 
competitive economy will settle somewhere on OF if the economy has 
a stable equilibrium.® An equilibrium such as £, in figure 2 represents 
a steady-state capital-labor ratio below the golden rule. This cannot be 
characterized as inefficient because of (temporary) sacrifices in con- 
sumption that would be required to reach a higher capital-labor ratio. 


Stability 


Substituting (10a) into (6) and using (4) and (5), we obtain a first-order 
difference equation in k; that describes the evolution of the model 
from arbitrary initial conditions: 


She) — Ref (Re) — Lf Ae) — heft), f (Reta) = Are +). (20) 
The model will be locally stable when | (dky+1/dk,) | < 1, that is, when 
| (cL, — DA"(L tnt+ef"y'| <1. (21) 


This says that a higher value of the capital stock in period ¢, k,L;, 
should not boost saving by young members of generation ¢ by more 
than the increase in k,L;. Note that the assumption that goods are 
normal (0 < c;, < 1) and the assumption that an increase in the 
interest rate reduces current consumption (c; < Q) are insufficient to 
ensure stability. 


Properties of the Autarky Equilibrium 


We can now establish five propositions about the autarky equilibria of 
two economies that are identical in all respects except for their pure 
rates of time preference. The utility function is given in equation (8). 

Proposition 1.—Given w; and 7;4,, a consumer with a higher pure 
rate of time preference will choose a higher value of consumption 
when young, ¢. 

Proof.—We want to show that the consumption function ¢j = c’(u,, 
+1, P) has the property ch = (dc'/dp) > 0. Differentiating {(1 + 
p)v'(c!)/v'[(w — c1)(1 + r)]} = 1 + 7, we obtain 


§ The assumptions we have made so far suffice to ensure the existence of a long-run 
equilibrium (Carmichael 1979). A (probably overly strong) sufficient condition for the 
uniqueness of the steady-state equilibrium is that preferences be homothetic (Car- 
michael 1979). 
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$= HO berslaro STR 


The utility function is increasing in c' and c”, so v' > 0. Strict convex- 
ity of the indifference curve requires v" < 0. Therefore (0c'/dp) > 0. 

Proposition 2.—When one starts from a given k; (and therefore a 
given w;), a higher pure rate of time preference is associated with a 
lower value of k;,,; (and a higher value of c/) if a higher interest rate 
does not reduce saving. 

Proof.—¥or the case of the additively separable utility function, 
equation (20) can be rewritten as: f(k,) — ke f’(Re) — c'Lf (Re) — Ae f' (Rt), 
F' (Re+1), P] = ki(1 +n). Thus (0k;,,/0p) = 4 Cp(l lan = rf" (Re+1)]}- 
A sufficient, but not necessary, condition for (0k;4,/0p) to be negative 
iN Gs =< ©. 

Proposition 3.—Given p, a lower value of k; will be associated with a 
lower value of k,,, if consumption in period 2 is a normal good and a 
higher interest rate does not reduce saving. 

Proof.—From (20) we obtain (0 y4,/0k,) = {-((1 — chiki f"(Ae IL + 
nm + cr f"(Re+1)]}. This will be positive if ci, < 1 and c} < 0. (Note that c}, 
a oe) 


Propositions 2 and 3 immediately imply proposition 4. 

Proposition 4.—Consider two economies, identical in all respects 
except the pure rate of time preference, which is higher for the home 
country than for the foreign country. When one begins from any 
common initial capital-labor ratio at t = to, the capital-labor ratio of 
the high-time-preference country will be below that of the low-time- 
preference country for all t > t) if consumption in period 2 is a normal 
good and a higher interest rate does not reduce saving. 

Proposition 4 tells us something about the behavior of the two 
economies were they to start from the same initial condition. It might 
still seem possible that if the high-pure-rate-of-time-preference 
country were to start off with an initial capital-labor ratio, Rtg 
sufficiently above the initial capital-labor ratio of the low-pure-rate- 
of-time-preference country, kj, k; could remain above ki indefinitely. 
To show that if k;, > k;, the two paths of the capital-labor ratio {k;} 
and {k,} will cross, it is necessary to show that the steady- state 
capital-labor ratio of the high-time-preference country, k, is less than 
the steady-state capital-labor ratio of the low-time-preference coun- 
try, k, and that the model is globally stable. Proposition 5 states the 
conditions under which the steady-state capital-labor ratio is a de- 
creasing function of the pure rate of time preference. Sufficient 
conditions for global stability are given in Carmichael (1979). After 
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the two paths cross for the first time, they never cross again because of 
propositions 2 and 3. 

Proposition 5.—Under autarky, the country with the higher pure 
rate of time preference will have the lower steady-state capital-labor 
ratio if the model is locally stable, consumption in both periods is a 
normal good, and an increase in the interest rate lowers consumption 
in period 1.7 

Proof.—Consider again the consumption function c} = c(w;, T1444, 
p). From proposition 1 we know that c} > 0. We find the effect of a 
higher value of p on steady-state k from f(k) — kf'(k) — c! = k(1 +n). 
Differentiating this yields: (0k/dp) = ch[(ci, — lkf" — (ch f” + 1 + 
yl ewith¢, 0 (Ok/Op y= requires (¢,)—91)ky = (en f" 1 ny 
0. This is ensured by stability (eq. [21]) if 0 <c, — l andc! S 0. 


III. The Two-Country Equilibrium 


The two countries that will be linked together in an international 
commodity market and an international capital market are identical 
in all respects, except for the pure rate of time preference. The home 
country is assumed to have a higher rate of pure time preference than 
the foreign country. For the home country, we adopt the notation 
developed for the autarkic economy. All variables associated with the 
foreign country are distinguished by a tilde above the relevant vari- 
able. Comparing the two countries, we have: 


I=); (22a) 
= (22b) 
n=n, (22c) 
(aye (22d) 
DEAD. (22e) 


Output can be moved costlessly between the two countries. 

In this “pure-absorption” model,’ international trade and interna- 
tional lending and borrowing (international capital mobility) are part 
and parcel of the same transaction. In a one-commodity model, the 
only way to pay for an extra unit of output today is with a promise of 
future output. Each trade balance transaction has to involve credit. 


7 Since only local stability is assumed, proposition 5 is valid only for small differences 
between the two countries’ pure rates of time preference. 

8 The model is easily generalized to include scale differences, as long as the natural 
rate of growth in both countries is the same. 

°I owe this description to Bill Branson. 
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There can be no current quid pro quo. We make the convenient 
assumption of perfect international mobility of financial capital: 
Ownership claims to domestic and foreign real capital are perfect 
substitutes in private portfolios in both countries. This means that 
interest rates will be equalized in the world economy: 


Tee OVeeais (23) 


There is a single perfect world financial capital market and a single 
perfect world market for the current production of the homogeneous 
commodity that can be used as a consumption good or a capital 
good.'° With international capital mobility, it is essential to distinguish 
between the capital stock used in production in the home country, K;, 
and the value of claims on real capital, domestic or foreign, owned by 
domestic residents, 4;. In the national income accounts the corre- 
sponding distinction is between domestic income or product and 
national income or product. Location and ownership of physical cap- 
ital no longer coincide. From interest equalization, free trade in capi- 
tal goods, and identical, linear homogeneous production functions, it 
follows that capital-labor ratios and wage rates are equal in the two 
countries: 


ky =k, (24a) 
Wy = Wy. (24b) 


The behavior of the integrated world economy is given by the op- 
timizing behavior of domestic and foreign households, factor market 
equilibrium conditions, and the world capital market equilibrium 
condition. Domestic household behavior is given by equations (1) (or 
its additively separable version [8]), (2), and (3). Foreign household 
behavior is represented by the same functions, with é}, 2?, and 6 
substituted for c?, c?, and p, respectively. Factor market equilibrium in 


both countries is given by (4) and (5), the world capital market equilib- 
rium condition by 


we ~ 5(6l + El) = hen +). (25) 


Openness is reflected in the equalization of interest rates and of 
marginal products of capital. Equation (25) states that world saving in 
period t, (w; — ci)L; + (W; — é)L, equals the world capital stock in 
perioditiel Ket Kay =Anw ede 

The individual optimization programs yield, in the additively 


‘° When the short-run gains from international economic integration are considered, 
it is assumed that the preexisting national capital stocks cannot be relocated. Newly 
produced capital goods can be located costlessly in either country. 
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separable case, equation (9’) as the first-order condition for the home 
country. Replacing ¢?, c?, and p by Zé, é?, and , (9’) also represents the 
first-order condition for the foreign country. Given the predeter- 
mined value of k, (= k;), the model determines the momentary equilib- 
Five values Of €7,67,.07) Cr. Wy, Treg, ANd Kia: 

The stability conditions for this world-economy model are very 
similar to those for the closed economy given in (21). As in the closed 
economy, we can solve the first-order conditions and household 
budget constraint for c/ as a function of w;, 7441, and p, and for ¢} asa 
function of w;, 7741, and p. As in the autarky case, we can now obtain a 
first-order difference equation in k¢: 


Fie) — haf (he) ~ 5 {OL fib) = Add, fess 


+ CL f (he) — kif’ (he), f' (Re+1), Pl} =hiil+n). (26) 
This model will be locally stable if 
(eh Heh — 2a 1201 + n) + (ch + DPT | < 1.8 (27) 


The interpretation of this stability condition is the same as in the 
closed-economy case. Again, the assumptions of normal goods and a 
nonnegative effect on saving of an increase in the interest rate do not 
suffice to establish stability. Note that, when tastes are identical, the 
model reduces to the closed economy. All international trade and 
international borrowing and lending are due to taste differences. 
Combining the rate of time preference with finite-lived individuals 
in a Samuelson-Diamond type of overlapping-generations model may 
be the only analytical way to introduce different pure rates of time 
preference in models with international trade in goods and financial 
claims. If an economy consists of infinite-lived individuals who 
maximize the sum of the infinite stream of utility (or consumption) 
discounted by their rate of time preference, then in the steady state 
the interest rate equals the pure rate of time preference because 
consumption does not vary over time. Therefore, if the world consists 
of two such economies with different rates of time preferences, there 
cannot be a steady state with international mobility of financial capital. 
Differences in the rates of time preferences imply differences in 
steady-state interest rates, as in Stiglitz (1970), whose model indeed 
does not have financial capital mobility, while financial capital mobility 
implies an identical steady-state interest rate. The use of the 
overlapping-generations model solves this problem since, even in the 


11 Using the more general utility functions u(c}, c?) for the home country and a (G7, é?) 
for the foreign country, we can solve for c} and é} as functions of w; and 74. The 
stability analysis carries through unchanged for this more general case. 
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steady state, first-period consumption need not equal second-period 
consumption (even though each generation’s lifetime consumption 
profile is the same), and thus the interest rate need not equal the rate 
of time preference.” 


The Balance-of-Payments Accounts 


In a two-country world, it suffices to analyze the balance-of-payments 
accounts of one of the countries. Without loss of generality, I focus 
on the home country: 


B,=—B, (28a) 
and 
G,= —-G,. (28b) 


As there is no official settlements balance, the current account surplus 
(deficit) of a country is identically equal to its capital account deficit 
(surplus). 

The balance-of-trade surplus is the excess of domestic product over 
domestic absorption. Domestic absorption is the sum of consumption 
and domestic capital formation: B, = Lf (ky) — ct LL; — cf Li-1 — (Kes — 
K;), or 


2 
be = f(he) — ch = — [1 + myers — Bel. (29) 
n 
The current account surplus is the excess of national product over 
domestic absorption. National product equals domestic product plus 
net foreign investment income. If 4; 1s the wealth of the home country 
at the beginning of period ¢, net claims on the rest of the world are 4; 
— K, and net foreign investment income 7;(4; — K;,). The country’s 
wealth at the beginning of period ¢ consists of the accumulated saving 
of the members of generation ¢ — 1: A; = (wi-y — ch4)Li-1, or 
nee Cres 
eTire ap te ome ed) 
National wealth can, and with different pure rates of time preference 
will, differ from the value of the domestic capital stock because of the 
scope for international borrowing and lending. There is a presump- 
tion that a nation consisting of people with high rates of time prefer- 
ence will tend to be a net foreign borrower. The truth of this pre- 
sumption will be investigated in what follows. 


 T am indebted to an anonymous referee for making this point. 
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The current account surplus of the home country is given by G; = 
Lif (he) + 1¢(4t — Kz) — fly — Galt) — (Kir — Kz), or 


& =f (ke) + (a, — ki) ee — [Cl + 2)ke1 — &). 31) 
With constant returns to scale in production and marginal produc- 
tivity factor pricing, f(A;) = w, + rk;. Equation (31) can therefore be 
rewritten as: 


2 
Ct-1 


= wy + 4a — ce — —— 
&t t tat t eos 


— [1 + n)ker1 — Rel. (31’) 
Equivalently, the current account surplus can be viewed as the net 
foreign investment by the home country. Net foreign investment is 
the excess of domestic wealth accumulation (saving) over domestic 
capital formation, that is, G; = Ay4, — A; — (Kis, — Ky), or 


pee ip [C14 a)heer, Fy (32) 


Steady-State Equilibrium 


The steady-state equilibrium of the two-country model is given by 
(11’) and (12) for the home country and their counterparts for the 
foreign country, the factor market equilibrium conditions (13) and 
(14), the stationary world capital market equilibrium conditions (33), 
the definition of home country net worth (34), and the balance-of- 
payments equations (35), (35'), (36), and (36’): 








w — (ch +0) = ACI Hey) (33) 
b =f(k) —c!— ae — hk, (35) 

or 
b= (n—TVG—k), (35") 
Cm fhe — hk) =e. — _ — nk, (36) 
g=n(a—hk). (36’) 


Neither the steady-state trade balance nor the steady-state current ac- 


3 Ara = (we— cL Qi. = Ate /Lig3 Chibi =A(1 +7) or c#4/(1 + n) =a,(1 + 
heat, We OV aa Cf = En nye =P ay) a}: 
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count balance need be in equilibrium. As regards the current account 
balance, this is the result of a nonzero rate of population growth and 
different tastes. If, for example, the home country is a net lender to 
the rest of the world (a > k), steady-state equilibrium requires a 
current account surplus sufficient to maintain the real per capita 
value of net claims on the rest of the world, that is, a surplus of n(a — 
k) (eq. [36’]). A steady-state current account surplus will be associated 
with a steady-state trade balance surplus if the interest rate is less than 
the rate of population growth, with a steady-state trade balance deficit 
otherwise (35’). 

Because the real wage and the interest rate are equalized through- 
out the world economy, the present discounted value of lifetime 
resources and lifetime consumption is equal for residents of both 
countries. If and only if the two countries have identical lifetime 
consumption paths (c! = @! and c” = @?) will there be no net foreign 
lending or borrowing. Each economic agent “lives within his means” 
in the sense that the present discounted value of lifetime resources 
(period 1 labor income) equals the present discounted value of 
lifetime consumption. Still, except when tastes are identical or popu- 
lation is stationary, one country will be steadily accumulating claims 
on the other. 

We next establish the following three propositions about the 
steady-state equilibrium of the world economy. 

Proposition 6.—The country whose residents have a higher value of 
consumption when young will run a steady-state current account 
deficit. 

Proof.—From (34) we know that a = [(w — c!)/(1 + n)] anda = [(w — 
¢1)/(1 + n)]. Thus c! > ¢? implies a < a. Rewrite (33) asa —k +a —k= 
0. Therefore a — k = —(@ —k) < 0. The current account surplus is n(a 
—k) <0. This proposition, unlike the following two, does not depend 
on the additively separable specification of the utility function. 

Proposition 7.—The country with the higher pure rate of time pref- 
erence has a steady-state current account deficit. 

Proof.—From proposition 6 we know that it suffices to show that p 
> p implies c' > ¢'. This was shown in proposition 1. 

Proposition 8.—The common steady-state open-economy capital- 
labor ratio lies between the two autarky capital-labor ratios if the 
model is stable under autarky and if the conditions of proposition 5 
are satisfied. 

Proof.—Let k* be the common open-economy capital-labor ratio, ky 
the autarky domestic capital-labor ratio, and ky the autarky foreign 
capital-labor ratio. We again assume that p > p. For brevity we write 
steady-state consumption in period | asc'(k, p) and ¢'(k, p), with cp, ch 
> 0. From the fact that consumption in period 1 is an increasing 
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function of p, it follows that 21(k, 6) < %[c(k, p) + €(k, p) | =c (ep). 
To establish the truth of proposition 8 we must rank the k’s deter- 
mined by the following three capital market equilibrium conditions: 
Sf (ku) — Ruf’ (hit) — c'(kn, p) = ku (1 + 1); f(R*) — RAP (RE) — Ya {lcl(R*, 
p) + e(h*, p)]} = A¥(1 +n); and f (he) — kof’ (ke) — 21(kp, p) = hp (1 +0) 
Assume that k* is below the lower of the two autarky capital-labor 
ratios, ky. In the autarky case, at a value of k below ky such as k*, there 
would be excess demand (i.e., excess saving) in the capital market—if 
the model is stable. A fortiori there would be excess demand if, in the 
home country capital market equilibrium condition, c'(k*, p) is re- 
placed by the smaller world-average level of consumption ¥4[c!(k*, p) 
+ ¢*(k*, p)]. Thus capital market equilibrium requires k* > ky. By 
exactly analogous reasoning, it is established that k* < kp. 
Diagrammatically, the stationary equilibrium of the two-country 
model can be represented as in figure 3. Here OF is the stationary 
decentralized consumption possibility locus for the hypothetical aver- 
age inhabitant of this two-country world economy, that is, someone 
with a lifetime consumption pattern )4(c! + é1), Ya(c? + é). It incorpo- 
rates the world capital market equilibrium condition (33) and the 
hypothetical average world citizen’s budget constraint, the sum of the 


tee 4 (<6?) 






=— slope -(1+n) 


slope -(1+f'(k)) 
Y=) 1/421 
0 pcs zc 46.) 


Fic. 3.—Autarky and open-economy equilibria for both countries 
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two countries’ private budget constraints. That is, OF represents 4(c' 
+ ¢') as a function of 4(c? + ¢), solved from: 


f(k) — Bf'(h) — alc! eye yel oy (37) 


Vol ete ic?) 


' lia 88 Ss 
F(k) — BEA) — ale aa ae 


(38) 


The OF also represents the stationary autarky consumption possibility 
locus for each country individually. 

While (37) and (38) must be satisfied in a full stationary equilibrium 
of the world economy, they are not a complete characterization of the 
stationary decentralized consumption possibility locus. Residents of 
each country must satisfy their budget constraints individually and 
not merely on average, across both countries as in (38). In figure 3, 
the indifference curve of each country is tangent to the common 
world capital market line. The home country, after integration into 
the world economy, consumes at E7. Its high level of c! is a reflection 
of its high pure rate of time preference. The FE} is the open-economy 
consumption equilibrium of the foreign country with the low pure 
rate of time preference. Average world consumption (and world 
production in both countries) is at E fy. Consistency requires that the 
distance Ef — Efy = Ey — EX. The home country borrows from the 
foreign country. From (34) and (33) we obtain that a — k = %4[(é! — 
c')\(1 + n)] < 0. The home country’s debt to the foreign country 
grows at the natural rate of growth. As we have drawn it, Ef) is ata 
capital-labor ratio below the golden rule, that is, r > n. The home 
country therefore runs a trade balance surplus. If the equilibrium had 
been beyond E*, the home country would still run a current account 
deficit, but with n > r it would now also run a trade balance deficit. The 
autarky equilibrium for the home country is at Ej’, ata capital-labor 
ratio below the common posttrade capital-labor ratio. The autarky 
equilibrium for the foreign country is at E#", at a capital-labor ratio 
above the common open-economy capital-labor ratio. 

Corollary.—The steady-state real wage of the capital-exporting 
country is below its autarky real wage and conversely for the 
capital-importing country. This confirms the analysis of MacDougall 
(1960), Kemp (1962), and Ruffin (1979). 


Non-Steady-State Behavior 


The behavior of the two-country model outside the steady state is not 
much more complicated than its steady-state behavior. There is only 
one state variable—the capital-labor ratio. It is easily seen from equa- 
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tion (39) below that the current account and trade account surpluses 
in period ¢ can also be expressed as a function of k,_,. The current 
account surplus of the home country is given by: 


at 
&t =f (Ae) + peat ae = Kil 6; 


oe (39) 

— ——— — [(1 + n) hens — A). 

SO = [0 + hea ~ Bel 

Will the country with the higher pure rate of time preference run a 

current account deficit throughout the non-steady-state adjustment 

process, the way it does in the steady state? Corresponding to (39) is 
the current account surplus equation for the foreign country: 


el - 
a ki) — 2 
(40) 


ne 
Ct-1 
— ~~ — [(1 + n)hes1 — hy). 
Ho (1 + nko ~ hel 
In this two-country world g; + g = 0. Comparing (39) and (40), we see 
(Hate with 9) =*p,6. =o}, C7 = t}_y,and' <2; > ch. The. excess of 
period 1 consumption in the home country over period | consump- 
tion in the foreign country makes for a current account deficit in the 
home country. The excess of period 2 consumption in the foreign 
country over period 2 consumption in the home country has the 
opposite effect. The home country will run a current account deficit if 
and only if 
r(Ct-1 — Ct-1) leet ft 
+ Ce — G = —————— 4] 
Ll+n ‘ : Ltn oe 
When the private sector budget constraints are used, this can be 
rewritten as 
1 1 Cet Cas ' 
c — ¢ > 41 
; Li ey 
With n > 0 this condition is certainly satisfied in the steady state with c} 
= cl, and Zé} = 1, because c! > @!. Outside the steady state, it will be 
satisfied if the increase in total consumption by the younger genera- 


tion between period ¢ — 1 and period ¢ in the home country exceeds 
that in the foreign country. This can be seen by rewriting (41’) as 


LiGi, haan = Lite = Lye bi=n- (41”) 
The current account deficit equals the excess of domestic capital 


formation over national saving. This holds for both countries. With 
perfect international capital mobility, &; is the same in both countries 
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in each period. Domestic capital formation will therefore be the same 
in the two countries. The home country will therefore run a current 
account deficit if and only if domestic saving falls short of foreign 
saving, that is, if and only if L¢(w; — ¢?) — Li-s(we-1 — Ct-1) < Li(w¢ — C2) 
— Ly-4(wy-1 — Ct-1). This is equivalent to (41”). While, with p > 6, each 
generation in the home country saves less than the corresponding 
generation in the foreign country, national saving in the home coun- 
try need not be less in any given period than national saving in the 
foreign country. National saving is the sum of saving by the young 
and dissaving by the old. In the home country the young save less, but 
as a consequence the old dissave less. 


IV. Gains from Participation in the World Economy 


The positive analysis of the determinants of international trade and 
lending can be complemented in a natural way by the analysis of the 
gains from participation in the international economy. When the 
gains or losses from participation in the world economy are evaluated, 
welfare in the world economy will be compared with the welfare that 
would have been achieved under continued autarky. Both short-run 
effects and long-run effects are considered. 


Short-Run Effects 


Short-run effects are the effects on the welfare of the two generations 
alive during the period that the two national capital markets and 
goods markets are integrated. The two countries evolve under au- 
tarky up to and including period t — 1. International integration 
occurs in period ¢. I shall consider in detail only the case in which 
preexisting capital goods cannot be shifted abroad. In this case, inte- 
gration of capital and goods markets in period ¢ does not permit the 
owners of capital that already exists at the beginning of period t (the 
savings in period ¢ — 1 of young members of generation ¢t — 1) to move 
it from country to country. This physical capital was fixed in place int 
— I to be used in the country of its original location. Because the 
physical capital stocks in the two countries in period ¢ are predeter- 
mined, both the interest rate faced by the old generation (t — 1) and 
the wage rate faced by the young generation are predetermined and 
unaffected by the international integration. In this case it does not 
matter whether the shift toward openness is anticipated or unantici- 
pated. The welfare of the old generation in both countries is there- 
fore the same with openness as it would have been under continued 
autarky. The welfare of the younger generation will be higher (lower) 
under openness than under autarky if the interest rate in period t, 
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Tt+1, is higher (lower) under openness than under autarky. In figure 1, 
a higher interest rate pivots the budget constraint upward around w,;, 
while a lower interest rate pivots it downward. The quantities {k”}, 
{k?}, and {k,} are the sequences of the capital-labor ratios for the 
home country under autarky, for the foreign country under autarky, 
and for the world economy under openness, respectively. The quan- 
tities {77}, {77}, and {r,} are the associated interest rate sequences. It 
is assumed that at the time of integration, t, the capital-labor ratio of 
the home country, k/, is not above that of the foreign country, kf’."4 

Proposition 9.—If kj’ < kj, if consumption in period 2 is a normal 
good, and if an increase in the interest rate increases saving, then kf’, 
< Ress < Riss or roy < tH < rh. 

Proof.—The proof is by contradiction. Assume that 7,4, > 74,. From 
propositions 2 and 3 we know that 77. > 774,. Therefore, if ct < 0 and 
¢1 < 0, saving by generation ¢ under openness in both the home 
country and the foreign country is higher than it would have been 
under autarky. Thus the per capita open-economy saving by genera- 
tion ¢ in the home country exceeds (1 + n)k/4,, and the per capita 
open-economy saving by generation ¢ in the foreign country exceeds 
(1 + n)k?,,. With populations of the same size in both countries, 
average per capita saving by generation ¢ in the world economy 
therefore certainly exceeds (1 + n)k#., (r#}., > rf, implies k#., < k?,,). 
Average per capita saving by generation ¢t in the world economy 
equals the average per capita capital stock in the world economy in 
period ¢ + 1. Therefore ki, > ki4.;. This contradicts the assumption 
that 74, > ri. Exactly analogous reasoning yields the result that k;,, 
< Kiss. 

The effect of participation in the world economy on the welfare of 
the two generations alive during the period in which integration 
occurs is therefore unambiguous. The welfare of the older generation 
in both countries is unaffected. The welfare of the younger genera- 
tion in the high-time-preference country (the home country) is low- 
ered, while the welfare of the younger generation in the low-time- 
preference country (the foreign country) is raised. During period ¢ 
capital flows from the low-time-preference, low interest rate country 
to the high-time-preference, high interest rate country, expanding 
the consumption possibility set of the younger generation in the 
low-time-preference country and shrinking that in the high-time- 
preference country. Since in the high-time-preference country the 
older generation neither gains nor loses from integration and the 


14 This will always be the case either if the two countries started off from the same 
initial condition (proposition 4) or if, from arbitrary initial conditions, the system had 
been evolving under autarky for a sufficiently long period of time (propositions 2, 3, 
and 5). 
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younger generation loses, a move toward integration will always be 
vetoed by the residents of the high-time-preference country if they 
vote according to their preferences—unless an appropriate transfer 
from the low-time-preference country is forthcoming.” 


Long-Run Effects 


It is not possible, except in special cases, to make unambiguous wel- 
fare comparisons by comparing steady-state utility levels—the rank- 
ing of the stationary utility levels may not be the same as the ranking 
of the utility levels achieved during the transition from one steady 
state to another. Nevertheless, the stationary utility levels achieved 
under autarky and openness are of interest. They are analyzed next. 
There are several cases to consider, illustrated in figures 4-9. 

Case 1.—The autarky capital-labor ratios of both countries are 
above the golden-rule capital-labor ratio. In figures 4 and 5, the 
autarky equilibrium of the home country is at E7’, that of the foreign 
country at E}” (proposition 5). The golden-rule capital-labor ratio is at 
E*, The open-economy consumption equilibrium of the home coun- 
try is at Ej, that of the foreign country at EZ. World production 
equilibrium is at Ej,, at a capital-labor ratio between the two closed- 
economy capital-labor ratios. The foreign country (the low-time- 
preference country) has a higher stationary level of utility under 
openness than under autarky (figs. 4 and 5). Two factors contribute to 
this. First, the open-economy capital-labor ratio, while still in the 
inefficient region, is lower than under autarky. The extent of overac- 
cumulation of capital is reduced, and the total amount of resources 
available for consumption is increased. Second, the foreign country is 
a net lender at an interest rate higher than the autarky interest rate. 
The home country (the high-time-preference country) can have 
either a lower (fig. 4) or a higher (fig. 5) stationary level of utility 
under openness. The degree of overaccumulation is increased by 
international integration. Against that, the home country is a net 
borrower at an interest rate below the autarky interest rate. 


“If preexisting capital goods can be shifted abroad, capital goods accumulated 
during period ¢ — 1 out of the saving of young members of generation t — 1 can be 
shifted between countries so as to establish a common interest rate, 7;, and marginal 
product of capital f’(k,) = r; for period t. It now makes a difference whether the move 
toward openness in period ¢ was anticipated in period t — 1. If perfect foresight prevails 
and if k#, < kf, it is easily shown that the old generation alive during the transition to 
openness (generation t — 1) will be worse off under openness than under continued 
autarky in the home country, better off in the foreign country. The effect on the 
welfare of the young generation (¢) is ambiguous. It is therefore no longer necessarily 
the case, as with nonshiftable capital, that the high-time-preference home country will 
always veto a move toward openness. 
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Fic. 4.—Long-run welfare comparison: both autarky capital-labor ratios above the 
golden rule. 


Case 2.—The autarky capital-labor ratios of both countries are 
below the golden-rule capital-labor ratio. This case is illustrated in 
figures 6 and 7. The high-time-preference country (the home coun- 
try) has a higher stationary utility level under openness than under 
autarky. Under openness, the capital-labor is higher but remains 
below the golden-rule level. The total amount of resources available 
for consumption is therefore increased. The country also is a net 
borrower at an interest rate lower than the autarky interest rate. The 
low-time-preference country (the foreign country) can have either a 
lower (fig. 6) or a higher (fig. 7) stationary level of utility under 
openness. Its capital-labor ratio is reduced in the efficient region, 
lowering the total amount of consumable resources. Against that, the 
foreign country is a net lender at an interest rate higher than its 
autarky interest rate. 

Case 3.—The autarky capital-labor ratios are on opposite sides of 
the golden rule. Two subcases need to be distinguished. 

Case 3a.—The open-economy capital-labor ratio is above the 
golden-rule capital-labor ratio. In this case the analysis for the low- 
time-preference country (the foreign country) is the same as in case 1: 
Its stationary utility level is unambiguously higher under openness 
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Fic. 5.—Long-run welfare comparison: both autarky capital-labor ratios above the 
golden rule. 
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Fic. 6.—Long-run welfare comparison: both autarky capital-labor ratios below the 
golden rule. 
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Fic. 7.—Long-run welfare comparison: both autarky capital-labor ratios below the 
golden rule. 


than under autarky. The stationary utility level of the high-time- 
preference country (the home country) can be either lower (fig. 8) or 
higher (fig. 9) under openness. There is a stronger presumption 
toward a higher utility level in this case than in case 1, when the 
autarky capital-labor ratio of the high-time-preference country was 
above the golden-rule level. In case 3a the open-economy capital- 
labor ratio at E},, while in the inefficient region, may still correspond 
to a stationary volume of consumable resources above that corre- 
sponding to E}’, which is below the golden rule. If it does, E}, will 
certainly correspond to a higher stationary utility level than E NT be- 
cause in addition the high-time-preference country is a net borrower 
at an interest rate below the autarky interest rate. 

Case 3b.—The open-economy capital-labor ratio is below the 
golden-rule capital-labor ratio. The analysis for the high-time- 
preference country (the home country) is now the same as in case 2: 
Its stationary utility level is unambiguously higher under openness 
than under autarky. The low-time-preference country’s stationary 
utility level can be either higher or lower under openness. No dia- 
grams are provided as they can easily be constructed by analogy with 
figures 8 and 9. 

An important clue to these welfare comparisons is provided by the 
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Fic. 8.—Long-run welfare comparison: the autarky capital-labor ratios straddle the 
golden rule. 


fact that international trade and financial mobility do not expand the 
consumption possibility set of the “average” world consumer under 
openness beyond that available to the individual countries under 
autarky. The OF in figures 3 and 4 is the stationary, decentralized 
consumption possibility locus for each of the countries under autarky 
and for the integrated world economy. The same applies in the short 
run: There are no potential gains in production from specialization. 
The only difference made by openness is that a worldwide capital 
market is substituted for two national capital markets. The equilib- 
rium in this worldwide capital market will be an average of the two 
autarky equilibria in the national capital markets. 


V. Conclusion 


Diamond’s overlapping-generations model is extended to a two- 
country world. A number of results are obtained. The country whose 
residents consume more in the first period of their lives (at a given 
wage rate and interest rate) has a steady-state current account deficit 
if the rate of population growth is positive. The country with the 
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Fic. 9.—Long-run welfare comparison: the autarky capital-labor ratios straddle the 
golden rule. 


higher pure rate of time preference has a steady-state current account 
deficit. The common stationary open-economy capital-labor ratio lies 
between the two autarky capital-labor ratios, with the lower autarky 
capital-labor ratio associated with the country whose residents have 
the higher pure rate of time preference. Outside the steady state, the 
country with the higher pure rate of time preference will not neces- 
sarily be the one to run a current account deficit. If preexisting capital 
is not shiftable between countries, the welfare of the old generation 
alive when the two autarkic countries join in an open economy is 
unaffected by this international economic integration. The welfare of 
the young generation in the country with the high pure rate of time 
preference is lower under openness than under autarky, while in the 
country with the low pure rate of time preference it is higher. 
The comparison of stationary utility levels under autarky and 
openness yields two unambiguous results. If both the autarky and the 
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open-economy capital-labor ratios of the low-time-preference country 
are above the golden-rule level, its stationary utility level will be 
higher under openness. If both the autarky and the open-economy 
capital-labor ratios of the high-time-preference country are below the 
golden-rule level, its stationary utility level will be higher under open- 
ness. If the first of these conditions holds, the move toward openness 
can be said to constitute a potential Pareto improvement for the 
low-time-preference country, because it always gains in the short run. 
Even if the second condition holds, openness cannot be said to be a 
Pareto-superior regime for the high-time-preference country, as it 
always incurs a welfare loss in the short run. 

The open-economy version of the overlapping-generations model 
can be extended in a number of directions. Within the context of the 
one-commodity model, budgetary and financial policy can be studied; 
this includes public sector lending and borrowing, taxation of wage 
and property income, social security, etc. The two-commodity struc- 
ture of the Heckscher-Ohlin model can be grafted onto the 
overlapping-generations framework.'® This allows potential gains 
from trade through specialization in production. If one of the goods 
is a capital good, the distinction between international mobility of 
financial capital—international lending and borrowing—and inter- 
national trade in capital goods becomes important. One could have 
either or both. The two-commodity structure permits the analysis of 
tariffs, quotas, and other trade policies, familiar from the static 
Heckscher-Ohlin model, in addition to the financial and budgetary 
policies that can be studied in the one-commodity overlapping- 
generations model. Finally, uncertainty can be introduced into the 
model—most interestingly into the two-commodity version. Without 
futures markets for commodities, households have to make labor 
supply and saving decisions while young, without knowing the future 
spot prices of commodities with certainty. The scope for further 
developments appears to be considerable. 


References 


Borts, George H. “A Theory of Long-Run International Capital Movements.” 
J.P.E. 72, no. 4 (August 1964): 341-59. 

Buiter, Willem H. “Government Finance in an Overlapping Generations 
Model with Gifts and Bequests.” In Social Secunty versus Private Saving, 
edited by George M. von Furstenberg. Cambridge, Mass.: Ballinger, 1979. 

Carmichael, Jeffrey. “The Role of Government Financial Policy in Economic 
Growth.” Ph.D. thesis, Princeton Univ., 1979. 


16 In a paper by Fried (1980), an overlapping-generations model with two tradable, 
nondurable commodities is analyzed. This model does not contain a variable capital 
stock. 


OVERLAPPING-GENERATIONS MODEL 797 


Cass, David, and Yaari, Menahem E. “A Re-examination of the Pure Con- 
sumption Loans Model.” J.P.E. 74, no. 4 (August 1966): 353-67. 

Diamond, Peter A. “National Debt in a Neoclassical Growth Model.” 4.E.R. 55 
(December 1965): 1126-50. 

Findlay, Ronald. “An ‘Austrian’ Model of International Trade and Interest 
Rate Equalization.” J.P.E. 86, no. 6 (December 1978): 989-1007. 

Fischer, Stanley, and Frenkel, Jacob A. “Investment, the Two-Sector Model 
and Trade in Debt and Capital Goods.” J. Internat. Econ. 2 (August 1972): 
211-33. 

Fried, Joel. “The Intergenerational Distribution of the Gains from Technical 
Change and from International Trade.” Canadian J. Econ. 13 (February 
1980): 65-81. 

Hori, Hajime, and Stein, Jerome L. “International Growth with Free Trade in 
Equities and Goods.” Internat. Econ. Rev. 19 (February 1977): 83-100. 
Ihori, Toshihiro. “The Golden Rule and the Role of Government in a Life 

Cycle Growth Model.” 4.E.R. 68 (June 1978): 389-96. 

Kemp, Murray C. “Foreign Investment and the National Advantage.” Econ. 
Rec. 38 (March 1962): 56-62. 

MacDougall, G. D. A. “The Benefits and Costs of Private Investment from 
Abroad: A Theoretical Approach.” Econ. Rec. 36 (March 1960): 13-35. 
Neher, Philip A. “International Capital Movements along Balanced Growth 

Paths.” Econ. Rec. 46 (September 1970): 393-401. 

Onitsuka, Yusuke. “International Capital Movements and the Patterns of 
Economic Growth.” 4.E.R. 64 (March 1974): 24-36. 

Ruffin, Roy J. “Growth and the Long-Run Theory of International Capital 
Movements.” 4.E.R. 69 (December 1979): 832-42. 

Samuelson, Paul A. “An Exact Consumption Loan Model of Interest with or 
without the Social Contrivance of Money.” J.P.E. 66, no. 6 (December 
1958): 467-82. 

Stiglitz, Joseph E. “Factor Price Equalization in a Dynamic Economy.” J.P.E. 
78, no. 3 (May/June 1970): 456-88. 


Mobility Costs, Frictional Unemployment, and 
Efficiency 
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With imperfect job information flows, it is plausible that the distri- 
bution of job offerings becomes more attractive when there are more 
vacancies and more unemployed. With word-of-mouth communica- 
tion, this condition is derived. Given this condition, steady-state 
equilibrium is not efficient, with welfare increased by the introduc- 
tion of unemployment compensation even though all agents are risk 
neutral. In this way workers become more selective in the jobs they 
accept. 


Moving and training costs play a significant role in job-taking deci- 
sions.’ Even if workers were equally productive in all jobs, these costs 
would make it sometimes worthwhile for an unemployed worker to 
refuse a job offer while waiting for a more attractive offer. The rate at 
which workers are offered jobs with different moving costs depends 
on the decisions of other workers as to which jobs to refuse. This 
externality implies that equilibrium will not generally be efficient. 
With the plausible assumption that job offerings become more attrac- 
tive on average when the number of available jobs increases, efficiency 
increases when workers are induced to pass up jobs with relatively 
high moving costs (by unemployment compensation, e.g.). 

These results are derived in a model of steady-state search equilib- 
rium similar to one that has been used elsewhere (see Diamond and 
Maskin 1979 and in press; Diamond 19808, 1980c; Mortensen, in press). 


Comments from Joel Yellin and a referee, research assistance by Drew Fudenberg, 
and financial support from NSF are gratefully acknowledged. 
' For a discussion of the role of employment in mobility decisions, see Bartel (1979). 
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Earlier analyses have focused on decisions which affect the rate at 
which workers receive job offers, taking as given the distribution of 
the quality of job offers. Here the arrival rate of job offers is taken as 
given, with the quality distribution endogenous. It is assumed that 
there is a fixed-coefficients technology. Workers and firms are taken 
to be risk neutral and to face a common exogenous interest rate. Both 
job termination and the arrival of new job offers are modeled as 
Poisson processes with fixed parameters. It is assumed that the wage 
and sharing of moving costs are negotiated,” with workers and firms 
equally good bargainers. The steady-state unemployment level is then 
determined by the job-taking decisions of workers. More stringent 
standards for job taking raise the vacancy rate, which in turn im- 
proves the distribution of job offers. Thus there can be multiple 
steady-state equilibria. From any steady-state equilibrium, inducing a 
permanent further decrease in the moving costs which workers are 
willing to bear raises the present discounted value of output in the 
economy. The optimal unemployment compensation benefit is de- 
rived in Section VI. Presentation of these results takes as given the 
relationship between the unemployment rate and the quality of job 
offers. In Section VII, a simple model of job-information flows is 
presented to derive an example of the way higher job availability 
might improve average job-offer quality. An example with an expo- 
nential distribution of moving costs is examined in detail in Sections 
VIII and IX. The example results in optimal benefits in the 
neighborhood of 60 percent of the wage. 


I. Employment 


Assume that all jobs are the same, with a fixed-coefficients technology 
that results in a flow of output, y. All jobs are subject to a risk of 
termination at the constant breakup rate b. This represents exogen- 
ous factors such as a transfer of the worker or the job opportunity to a 
different location. We ignore the possibility of the worker and the job 
moving together. We also ignore endogenous reasons for job termi- 
nation, such as quitting for a better job or laying off workers to hire 
different workers. 

At initial employment, the wage is negotiated, as is the sharing of 
setup costs. For simplicity we shall assume complete symmetry be- 
tween workers and jobs (including equal numbers) implying that the 
net gain from commencing production is shared equally between 


2 Negotiation is assumed to be instantaneous, ruling out one route by which the 
presence of other workers affects a job seeker—that of waiting while a decision is made 
about employing a different worker. 
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worker and employer. In the absence of labor disutility, capital user 
cost, and unemployment compensation, this implies equal sharing of 
the flow of output and so a wage equal to y/2. Workers are assumed to 
be risk neutral and to face a constant interest rate r. With these 
assumptions, it is appropriate to focus attention on the expected 
present discounted value of earnings less moving costs. Denote by Wy 
and W, the expected discounted value of earnings less moving costs 
for employed and unemployed workers, respectively. With complete 
symmetry assumptions about both finding and losing jobs, with an 
infinite expected life, and with analysis restricted to the steady state, 
W;, and Wy, will not vary over time or workers. 

For an employed worker, the rate of interest times expected earn- 
ings equals the wage less the expected capital loss from job termina- 
tion:? 


The next step in the analysis is to consider job taking and so the 
determination of Wy. 


II. Job Taking 


Assume that unemployed workers learn about job opportunities with 
an exogenous’ arrival rate, a. We assume no search costs and no 
decisions of an individual which affect this arrival rate. While the 
productivity of all jobs is taken to be the same for all workers, jobs 
differ across workers in the setup costs before production begins.® 
These costs reflect moving costs when workers must relocate to take 
new jobs and specific training costs before production can begin. The 
pattern of a fixed cost followed by constant output can be viewed as an 
approximation to the increased output that comes with on-the-job 
learning. We ignore the variation among jobs in commuting costs 
since these represent a variation in flow benefits rather than setup 
costs. 

Denote by G(c,w) the distribution of setup costs associated with jobs 
an individual learns about when the unemployment rate is u. A higher 
unemployment rate (and higher vacancy rate) is assumed to improve 
the distribution of moving costs in the sense that G,,(c,u) > 0 where 


* Considering a worker as an asset, eq. (1) is stated as the rate of return times the 
value of an asset equals the flow return plus expected capital gain. It can be derived 
alternatively from explicit calculation of the present value of expected returns as in 
Diamond (19808) or as the limit of a discrete time process: Wy = (1 + rAt)"'[yAt/2 + 
bAtW, + (1 — bAt)We). 

* Below, we will allow a to vary with the unemployment rate. 

* For a detailed analysis of individual choice with setup costs, see Loikkanen and 
Pursiheimo (1979). 
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G(c,u) is positive and less than one.® In Sections VII and VIII we 
derive this assumption from specific models of information flows. We 
take the distribution to be constant over time for a given worker. With 
symmetry between workers and jobs, setup costs are assumed to be 
equally divided between worker and job. If an unemployed worker 
accepts any job with a setup cost less than c*, then, in the absence of 
unemployment benefits, we can write the expected discounted value 
of net earnings implicitly as 


Wy = al We a Waa cl2jdGe.u). (2) 


That is, the interest rate times the expected value of earnings equals 
the expected gain from job taking, less setup costs.” 

The choice problem for the individual worker is the selection of c* 
to maximize W,. Naturally, this involves accepting any job for which 
the setup cost is less than the expected gain from job taking:* 


c* = 2(Wz — Wy) = (» + af, cac\e Se Ot AG (C*-1) |, (3) 


where (1) and (2) have been solved to give the implicit equation for c*. 
Workers are more willing to bear setup costs when output is greater, 
job finding is more difficult, the interest rate is lower, expected job 
duration (5~') is longer, or the unemployment rate is lower. That is, 
from implicit differentiation of (3) we have dc*/dy > 0, dc*/da < 0, 
Oc*/Or < 0, 0c*/0b < 0, and Oc*/du < 0. 


III. Equilibrium 


In steady-state equilibrium, the aggregate rate of job finding must 
equal the rate of job losing. Denoting the unemployment rate by u, 
this gives us 


b(1 — uw) = auG(c*,u). (4) 


That is, the job breakup rate times the proportion employed equals 
the job acceptance rate times the proportion unemployed. Since G,, is 
assumed positive, we have c* decreasing with u. One would not gener- 
ally expect to find the rate, a, at which workers learn of potential jobs 


®° To interpret this assumption one needs to compare alternative economies with 
different steady-state employment and vacancy rates. It is not appropriate to consider a 
single economy over a business cycle, since in that case a rise in unemployment is 
accompanied by a decline in vacancies rather than a move in the same direction. 

7 As a discrete time process, we would have W, = (1 + rAt)“(aAt{G (c*,u)Wp +[1—- 
G(c*,u)|Wy — [%c/2dG} + (1 — aAt)Wy). 

8 Eq. (3) comes from joint wealth maximization of firm and worker and does not 
depend on the equal sharing rule. 
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to be independent of the unemployment and vacancy rates (which are 
equal by assumption). We would expect a to increase with wu across 
steady states. Since the implications of this relationship have been 
explored elsewhere, we take a to be exogenous here but note the 
implied differences in footnotes. Similar considerations hold for bd. 

Since c* decreases with u in both (3) and (4), we have the possibility 
of multiple steady-state equilibria, that is, multiple solutions to (3) and 
(4). When more jobs are available (higher w), anticipated mobility 
costs are lower (higher G) and individuals are more selective in the 
jobs they take (lower c*). Greater selectivity by workers, in turn, raises 
the unemployment rate. This is shown in figure 1, where equations (3) 
and (4) are drawn. 

We note that an individual will not accept a job with an expected 
return below setup costs. Thus the chosen cutoff c* is less than y/(r + 
b). Even if expected future setup costs are close to zero, the potential 
loss in forgone wages implies that the chosen cutoff c* will never be 
less than y/(r + b + a). If all offered jobs are accepted, the equilibrium 
unemployment rate is b/(a + b). If none is accepted, the unemploy- 
ment rate goes to one. 


IV. Steady-State Output 


All steady-state equilibria lie along the curve given in (4). At any point 
on this curve we can calculate the steady-state output level per person. 
There are 1 — u employed per person giving a gross output flow per 
person of (1 — u)y. Moving costs per person equal the rate of new job 
taking per person of auG times the average cost of moving, ie cdG/G. 
Thus net output, Q, satisfies 


O = (14) — au fi cac. (5) 


Differentiating with respect to u, with c* given as an implicit function 
of u by (4), evaluating at an equilibrium where (3) holds, and using 


b(|-u) = auG 


y Ar+b) 


c* 


c =[y +a | cd6]/[r+b +06] 


y/(r+b+a) © 





O b/(a+b) | u 
Fic. 1 
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integration by parts, we have 


* 


die VA in ier adc 0 if 
as er ey cdG — auc*g ae au | cg dc 





: (6) 
= fees au { G,dc, 
0 


where g = G,. The second term reflects the externality in lower 
moving costs of a higher unemployment rate. The first factor reflects 
the absence of discounting in steady-state comparisons. 


V. Dynamics 


The economy analyzed here cannot move directly from one steady 
state to another. While it is interesting to compare alternative steady 
states which might have occurred, proper policy analysis requires 
consideration of the comparative statics of the actual path of the 
economy. We shall analyze the effects of a policy which controls c* 
directly. That is, we consider an economy where job acceptance be- 
havior is sufficiently closely monitored to make it a government con- 
trol variable. Below we will examine how unemployment compensa- 
tion can be used to induce the same steady-state equilibrium without 
monitoring job acceptance behavior. These two modes of control 
correspond to stylized versions of the German and American econo- 
mies. 

Unemployment grows by job terminations and declines by job ac- 
ceptances: 


u = b(1 — u) — auG(c*,u). (7) 


Starting at a steady-state equilibrium given by (3) and (4) we shall 
calculate the change in the present discounted value of aggregate net 
output, W, from a permanent differential change in c*, with unem- 
ployment given by (7). 

That is, we want to calculate the derivative with respect to c* of 


W =foen{u — u(t)]y — au(t) [cae [G: wilde, (8) 


where u(t) satisfies (7) with an initial condition satisfying (3) and (4). 
Calculating this derivative (see Diamond 1980a), we have 





ow = auc*g 
Oc* Yr 


ytaf dG + au { cg dc 
0 0 aug (9) 
r r+b+aG +auG,/ 
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Using (3) we can write this as® 


(ie Ue OG ea auf Cg dc 





Ow = (8) 2p 


Oc* Yr r+b+aG+auG, 
—(a'ug) (cGy =| eg ule (10) 
0) 
a rr +b aG + auG,) 
mie etree Sate <0 
r r+b+aG+auG, ; 


Thus a government policy to move the economy out of a steady-state 
equilibrium in the direction of higher unemployment raises efficiency 
by improving the average quality of job offers.'° 


VI. Unemployment Compensation 


There are several modes of intervention which will raise the equilib- 
rium unemployment rate and so the efficiency of this economy. By 
increasing the importance of adjustment costs relative to the financial 
gains of employment, the government can induce greater selectivity 
in job acceptance. Thus, taxing output without allowing a deduction 
for adjustment costs or subsidizing unemployment or vacancies will 
have the desired end. Of these we shall analyze unemployment com- 
pensation. In addition to making workers more selective in accepting 
jobs, unemployment compensation raises the wage by one-half the 
unemployment compensation benefit given the negotiation assump- 
tions we have made here. 

From the perspective of the worker and employer, the net gain 
from production is y — B, where B is the unemployment benefit. With 
this net gain shared equally, the firm receives O59 -B )eand the 
worker receives a payment for forgoing unemployment compensa- 
tion of B in addition to half the net gain, 0.5(y — B). Rewriting the two 
value equations (1) and (2), we have!! 


° If a is a function of u, (10) becomes 


OW _ ~—awg Jo (aG, + a'G)de 


ack r r+b+aG+auG, + a'uG | 


‘It is important to remember that we are considering policies that increase both 
vacancies and unemployed together. For an analysis of the gain from aggregate de- 
mand stimulation, see Diamond (1980c). 

1! To derive (11) formally, we need first to reconsider the equal-sharing rule above. 
That was derived from equations for W,; and Wy, similar equations for the values of 
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Wy=B af (We = Wy — c/2)dG(c,u). 


Then, the chosen cutoff cost satisfies 
c*¥ = (Wz — Wy) = (y pyres af cdG Jr +b+aG). (12) 


Increasing the unemployment benefit by $1.00 decreases the cutoff 
level of costs selected at a constant unemployment rate by (r + 6 + 
aG)~* dollars. The induced increase in unemployment magnifies this 
effect. 

Setting equation (9) equal to zero, we can derive the cutoff cost level 
which would be optimal in a steady state. That is, an optimal trajectory 
will, asymptotically, have the cutoff cost level, c*, satisfying” 


y +af cdG +auf cg de 
ee (13) 
r+ b—~aG + auG, 
Solving (12) and (13), we can derive the unemployment compensation 
benefit which holds asymptotically on the efficient trajectory (i.e., 
which supports the asymptotic optimum): 


(y + af cdG \auG = au | cg,dc(r + b + aG) 
0 0 


B*¥ = 
TRO Ge OuG, 


(14) 
Substituting from the first-order condition for c*, B* can be written 
alternatively as'® 


Be= au(*G, “J eg ue | ~ au [ Gile 0, (15) 
0 0 


Passing up a job offer alters the trajectory of the economy and so 
generates a pattern of externalities that varies over time. The present 


filled and vacant jobs, W, and Wy, and an equal sharing of surplus value, W, — Wy = 
W, — W,. With unemployment compensation and equal sharing of setup costs the four 
value equations can be expressed in terms of the wage, w: 


1W, = w — b(Wz — Wo), 

TVW Vere Wige— Wy), 

Wy =B +af (Ws — Wy — c/2)dG, 
rW, =af (Wr — Wy — c/2)dG. 


Combining these with the equal gain rule gives (11). 
2 From (13) and the first line of (6), we not¢ that dQ/du < 0 at u*. 
13 Tf a varies with u, (15) becomes B*¥ = u fo (aG, + a'G)dc. 
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discounted value of the social gain from accepting a job with setup 
cost c can be derived by differentiating W, in (8), with respect to w. 
Evaluated at a steady state this gives 


OW 2 22 oe 16 

ee r+b+aG+auG, io a0) 

In an efficient equilibrium this social gain must equal the private gain 

2(W, — W,) — c. Equating the social and private gains, using (12), we 

have an alternative derivation of the asymptotically optimal unem- 
ployment compensation. 


VII. Discrete Example 


Having examined the implications of the dependence of the distribu- 
tion of adjustment costs of job offers on the unemployment rate, we 
now consider an example of how word-of-mouth communication of 
job availability can generate such dependence.“ In this section we 
consider an example with two locations. Specific training costs are ¢. 
There are no moving costs if the job and worker are in the same 
location. There are moving costs c, if the job and worker are in 
different locations. For the next section we consider an example 
where c is distributed exponentially with a parameter that depends 
linearly on the unemployment rate. 

Assume that employed workers learn of a job opportunity in the 
same location and inform their unemployed friends of its existence. 
Assume that the process of job communication is such that attempted 
communication about any vacancy is a Poisson process with constant 
parameter a. (In practice it is likely that there are more attempted 
communications of new job vacancies than of old ones.) Each poten- 
tial communicator knows n workers who could fill this job. Assume 
that each of the n has a probability u of being unemployed and an 
independent probability p of being in the same location. With proba- 
bility (1 — uw)” all n friends are employed and there is no one to tell of 
the job. If the communicator does have unemployed friends, he only 
tells a friend in the other location if none of his unemployed friends 
are in the same location. That is, he tells the friend to whom the 
information is most valuable. The probability of his telling someone in 
the other location is [(1 — up)" — (1 — w)"]—the probability of no 
unemployed friends with the same location less the probability of no 
unemployed friends. Then | — (1 — up)” is the probability of telling a 


‘4 For previous analyses of word-of-mouth communication, see Boorman (1975) and 
Satterthwaite (1979). 
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friend in the same location. The greater the unemployment rate, the 
greater the probability of telling a friend in the same location. 

With everyone following the same behavior rule, there are two 
candidates for equilibrium: accept only jobs at the same location, or 
accept any job offer. Let u, and u, be the unemployment rates under 
these two behavior rules. Then the equilibria satisfy 


b(1 — u,) = au,[1 — 1 — u,p)") 
b(1 — uz) = aua[1 — (1 — ue)"]. 


(17) 


For some parameter values both of these equilibria will exist. Rather 
than pursue this example in more detail, we turn to a similar example 
with a continuum of locations, which can be considered to be around a 
circle. It is then assumed that the likelihood of knowing an individual 
is exponentially distributed with the distance to his location and that 
communication goes to the unemployed person for whom the moving 
costs are smallest. 


VIII. Exponential Example: Steady-State Properties 


We assume that c has an exponential distribution with coefficient nu. 
This example is chosen to fit the discussion above since the minimum 
of a random sample of size nu from the exponential distribution with 
coefficient 1 is exponential with coefficient nu. Thus we assume that 


GAG Lee ee (18) 

With this distribution, the two equilibrium equations (3) and (4) satisfy 
* — a SS nnuc= 19 

CE (rd 0 a) Narre eae e ) (19) 

DE 76) = ail boa ee"), (20) 


There exists a unique solution to this pair of equations with u between 
b/(a + b) and one and c* between y/(r + b + a) and y/(r + 6). 

In a steady state (whether equilibrium or not), the net output flow 
satisfies, from equation (5), 


OO (yw) y rian FLL = (1 + nue*)| 
ee) 


nu 


- (1 - ot) inf - bles) ' 


n au 


sic haeth)y (21) 
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The asymptotically optimal unemployment compensation satisfies, 
from equation (15), 


Be= (<)u =e MEE (uc ® 1) 


(22) 
- $09 4 (21 --] in [1 - “UL —) . 
From (21) and (22) we note that 
Ov (lies) Viens (23) 


Alternatively this follows from (15) since, with the exponential dis- 
tribution, f uG, = (r cg. From (13) we note that at the steady-state 
optimum the maximal acceptable moving costs satisfy 


» apo (24) 


To find the optimal unemployment rate, we solve (20) and (24) 
simultaneously. 

Some examples are shown in table 1. For these calculations, 
parameters were chosen for a, 6, and r. Next an equilibrium unem- 
ployment rate in the absence of government intervention, u, was 
chosen. This implied a particular value for the product ny, and the 
remaining calculations were done for this value. In addition to 
steady-state comparisons, the last column of table 1 reports the per- 
centage change in the present discounted value of net output along 
the trajectory from the equilibrium steady state to the optimal one 
where the fraction of accepted jobs, G, is held constant at its 
asymptotically optimal value. As detailed in the next section, the 
increase in net output along the trajectory exceeds the difference 
between steady states. 

As a guide to interpreting the table, consider the fourth row. In an 
economy where the unemployed receive 10 job offers a year, the 
expected duration of a job is 5 years, the interest rate is 5 percent, and 
the equilibrium unemployment rate is 4 percent, 3 percent of gross 
output is spent on moving costs, workers are just willing to spend 34 
percent of a year’s output on moving, and workers accept 48 percent 
of jobs they hear about. The asymptotically optimal unemployment 
rate is 4.7 percent, which can be induced by unemployment compen- 
sation equal to 60 percent of the wage. In this equilibrium workers 
accept 41 percent of job offers representing an 18 percent increase in 
the expected duration of unemployment, and steady-state net output 
is higher than in the no-compensation equilibrium by 0.23 percent 
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with a gain of 0.24 percent along the constant G trajectory from the 
initial equilibrium to the asymptotic optimum. 

The examples show a surprisingly consistent pattern. When the 
equilibrium unemployment rate is very close to b/(a + 5b), the 
minimum achievable,’ the optimal unemployment compensation is 
small. As wu rises, B*/w rises very rapidly, reaching the neighborhood 
of 60 percent when u is about one percentage point above the achiev- 
able minimum. The ratio B*/w stays in the neighborhood of 60 
percent for all calculated values, which included values of u as integer 
percentages up to 10 percent. The same pattern arose for all calcu- 
lated values of a, b, and r. 


IX. Continuous Example Dynamics 


The transition from the equilibrium steady state to the optimal one 
shows a larger change in net output than does the steady-state com- 
parison. To see this, let us consider a policy of changing the propor- 
tion of jobs accepted by an unemployed worker. (We shall also con- 
sider the optimal policy below.) If workers are accepting all jobs with 
costs below c*(t), then the proportion of jobs accepted equals 1 — 
e mmc Tet us consider the policy which holds this proportion 
constant over time at the level that occurs in the asymptotic steady 
state which we write as G*. Then the economy follows the differential 
equations 


w(t) = b[1 — u(t)] — au(t)G* 


(25) 
(tut) 
C(t) u(t) 


We assume an initial condition at time t) of the steady-state equilib- 
rium. The immediate effect of the change in policy is to decrease 
aggregate setup costs without changing gross output. Over time the 
unemployment rate rises, decreasing gross output but keeping aggre- 
gate setup costs constant. This pattern is shown in figure 2. To 
calculate W along this trajectory, begin by considering aggregate 
setup costs. These equal 


c*(t) 
au(t) | enue ede = —{l =e ne nu (tct) 4 1). (26) 


Since u(t)c*(t) is constant, we have 
QO) = Q* + ylu* = ua). (27) 


® When a = 10 and b = 0.2, bi(a + b) = .019608. 
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Fre. 2 


Solving (25) for u(t), we have 
w(t) = uF + (uw — u*)e OU, (28) 


Thus the present discounted value of output satisfies 


Pecos 2 = 
(r ae uF ) 
Since the optimal unemployment rate exceeds the equilibrium rate, 
the value of output along this path exceeds its value at the optimal 
steady state. The excess of W’ over equilibrium output, W, is shown in 
table 1. The addition to the value of unemployment compensation 
from analyzing the dynamic path is small but noticeable for the 
examples calculated. 
In addition to considering this path, which was chosen for its ease of 
analysis, it is interesting to analyze the optimal path assuming the 
government could control c*(t). The choice problem is 


(29) 





max J em((1 — w(oly ~ £0 ~ morse + nu(er(O1}) a 
(30) 
s.t. w(t) = [1 — u(t)]b — au(t)[1 — err). 
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The first-order condition for ¢* is 
Ch) = Dt) Visa (lence 2d. (31) 


The phase diagram for the optimal path is shown in figure 3. Along 
the optimal path, c*(¢) is falling while w(t) is rising. Their product, and 
so the proportion of job offers accepted, is rising. In terms of figure 2 
the optimal path of net output converges to Q* more rapidly than the 
path with wc* constant, starting from a higher initial net output level. 

Baily (1977) and Flemming (1978) have analyzed optimal unem- 
ployment compensation assuming risk-averse workers and no exter- 
nalities in the labor allocation process. This paper takes the opposite 
tack of assuming risk-neutral workers and externalities. It seems likely 
that workers are risk averse and externalities are present, making a 
much stronger case for unemployment compensation. 
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International Transmission of a Real Shock 
under Flexible Exchange Rates: A Comment 


Betty C. Daniel 


Tulane University 


In a recent article in this journal, Hamada and Sakurai (1978) claim to 
have demonstrated that a recession in a foreign country, generated 
either by a reduction in monetary growth or by a decline in produc- 
tivity, is transmitted to the domestic country as a temporary 
stagflation. They emphasize the wage-price spiral as the transmission 
mechanism which operates under fixed and flexible exchange rates. 
In particular, any fall in foreign output causes an increase in the 
relative price of foreign goods. This puts upward pressure on home 
wages through a Phillips curve, which includes expected inflation of 
consumer prices; employment and output fall while inflation in- 
creases. The article deserves acclaim for focusing on the neglected 
“wage-price spiral” transmission mechanism; however, it is misleading 
in its assertion that a reduction in output abroad, whatever its cause, 
has temporary effects at home. 

The purpose of this note is to demonstrate that the wage-price 
spiral transmission mechanism, following a decline in foreign pro- 
ductivity, generates a permanent fall in domestic output as well as 
temporary stagflation. The productivity decline, which permanently 
lowers foreign output, permanently deteriorates the domestic terms of 
trade and permanently reduces domestic output by reducing the pur- 
chasing power of labor’s wage. The deterioration in the terms of trade 
leaves the real producer wage (wage divided by the price of the 
domestic output) unaffected. However, the higher price of imports 


I wish to thank my students Rebecca Morton and Vincent Salvo for drawing attention 
to the inconsistency implied by temporary domestic effects of a permanent foreign real 
shock. I also wish to thank Koichi Hamada for excellent comments on an earlier draft. 
Partial financial support from the Murphy Institute of Political Economy is gratefully 
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reduces labor’s real wage (wage divided by the consumer price index), 
creating an excess demand for labor. The nominal wage is bid up, and 
employment and output fall.’ 

The source of the authors’ result is an incompletely specified 
model. Their model is adequate for monetary shocks (their primary 
interest) but not for real shocks. Extension of their model to meaning- 
fully incorporate real shocks requires a respecification of their Phillips 
curve. 

The authors specify the domestic and foreign Phillips curves by 
letting the rate of wage inflation in each country (w, w*, where an 
asterisk refers to a foreign variable) equal expected inflation in the 
consumer price index (7, 7*) plus a function () of output produced 
in each country (X,,X#, where subscripts 1 and 2 refer to the com- 
modities produced in the domestic and foreign countries, respec- 
tively). They are expressed as 


w = W(X) + 7, (1-1) 
w* = p(X) + a*. (1-2) 


(Equation numbers correspond to those of the authors unless pre- 
ceded by the letter A. The latter are alternative equations proposed 
by myself.) 

The long-run requirement that w = 7 implies that there is only one 
value of X, consistent with long-run equilibrium. The analogous rela- 
tion holds for the foreign country. This implication is clearly incon- 
sistent with allowing real shocks to have permanent long-run output 
effects. In particular, it is inconsistent with the author’s claim that a 
productivity decline in the foreign country permanently reduces X ¥. 
This problem is easily solved by respecifying the Phillips curve to 
allow wage inflation to depend on the deviation of output from 
long-run equilibrium as well as on expected inflation according to 


w= W(X, — X,) + a, (Al-1) 
w* = W(X¥ — X¥) + 7*, (A1-2) 


where a bar denotes a long-run equilibrium. 

It is now necessary to specify the determinants of long-run output 
implied by the authors’ model. Steady state is characterized by 
equality in the rates of growth of money (6), prices (p), and wages (w). 
Output is determined by the authors’ production functions, 


SNe, C= Bp | (3-1) 
AF] NE 0 = BF <1: (3-2) 


" This effect is discussed in Salop (1974), Daniel (1977), and Leiderman (1979). 
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where X denotes output, N denotes employment, and B is the em- 
ployment elasticity of output. 

Labor demand is determined to equate the nominal wage (w) with 
the value of marginal product according to 


it Pi pN oe, (4-1) 
te psp No (4-2) 


where p denotes price. 

Prices are determined by the assumption that expenditure on each 
commodity is a constant fraction of total expenditure. This assump- 
tion together with the authors’ assumption that expenditure equals 
income yields the authors’ equations, 


piX1 = vM, (13-1) 
pExt = vM*. (13-2) 


To close the model for long-run output, a specification of the 
long-run supply of labor is necessary. I assume labor supplied is a 
positive function of the real wage according to 


N = (w/p)’, (a) 
N* = (w*/p*)™, (b) 


where 6 and 6* > 0 are the long-run real wage elasticities of the labor 
supply, and p and p* are the consumer price indices in the domestic 
and foreign countries and are defined by the authors as p = p{p2* 
and p* = p#*p¥'*. This specification is in keeping with the short-run 
Phillips curve whereby a slower growth rate of nominal wages, relative 
to the price level, reduces the real wage and reduces output. By 
allowing labor supply to care about the real wage in the long run as 
well as in the short run, the possibility that a disturbance which alters 
the real wage can permanently alter output is admitted. This is pref- 
erable to assuming away long-run effects by fixing long-run labor 
supply while allowing short-run labor supply to respond. 
Equations (3-1), (3-2), (4-1), (4-2), (13-1), (13-2), (a), and (b) can be 
solved to yield the reduced-form solution for long-run output: 


= + + 

X, = ¥(B, B*), (c) 

= + + 

AG == VRB, (B*). (d) 
It is important to notice that a fall in B* permanently reduces domes- 


tic output because it deteriorates the home country’s terms of trade 
thereby reducing labor’s real wage.” 


2 This can be proved by totally differentiating the log linear version of the model 
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It is now possible to meaningfully analyze a shock to foreign pro- 
ductivity. The respecified Phillips curve together with explicit allow- 
ance for changes in productivity require rewriting the authors’ 
equations (20) and (22) as 





X= plo rx aes (1 + as \p 2 | (A20) 


Ry = peor — yxe — RH + (1+ Ebr — wt, (422) 


where * denotes a proportional rate of change. The long-run re- 
quirement that 0 — 7 = Xie B = 0 (6* — m* = X, = B* = O) implies 
that X, = X, (X, = X,). Anything which changes the long-run level of 
output requires an eventual corresponding change in X, (X2). 

The model is completed with the assumption of adaptive inflation- 
ary expectations which yields the authors’ equations (21) and (23). 


They are unchanged by the foregoing modifications: 
a = {0 — [aX, + (1 — a) X¥] — zr}, (21) 
a* = y{O* — [aX, + (1 — a) X¥] — w*}. (23) 


Figure | depicts the relationships between 7 and X, necessary for oe 
= 0 and 7 = 0 for given 8, B, X,, X¥, and 6 and the corresponding 
relationships in the foreign country. Equations (A20) and (A22) imply 
that for X,; = 0, inflation must be negatively correlated with output. 
The same type of relation exists in the foreign country. The 7 = 0 
relation is determined by substituting equation (A20) into (21) and 
analogously for 7* = 0. Dynamics are described by arrows following 
the authors. 

Now consider a fall in foreign productivity. This can be represented 


composed of eqq. (4-1), (4-2), (13-1), (13-2), (a), and (b) where the exchange rate is 


normalized at unity. 


dp, _ —(s(1—a)+(m — BU + n* InN*)p, <0 
x 5 





dp* 

dp, _ BU Sa) ty mNO pe 

dp* A ; 
dN* _ {(1 ~ @)[(B/6*) + 7 In N*] + (1/8*)(m — B)IN* S 9 
dp* A ; 
iN. (lio) (i ayn NN 

ap at ame ah tae tan =), 


where A = a8*(n — B) + (1 — a)(n* — B*)B + (ym — B)(n* — B*), n= (1 + 8)/8, and 7* = 
(1 + 8*)/8*. With B = B* initially, as the authors assume, it is obvious that a reduction in 
B* deteriorates the domestic terms of trade and reduces foreign employment relatively 
more than domestic employment. Foreign output also falls relatively more from eqq. 


(3-1) and (3-2). 
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Domestic Country Foreign Country 
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by a shock to £*. In equation (A22), B* falls temporarily and X# falls 
permanently. Both effects shift X¥ = 0 left in figure 1, and since the 
fall in X¥ is permanent, it remains left of its original position. In 
equation (23), the fall in 8* stimulates inflationary expectations by 
reducing X, (from [A20] and [c]) and by reducing X ¥ (from [A22] and 
[d]); a* = 0 shifts up to the left permanently. Under the assumption 
that B* # O for a very short period of time, the foreign adjustment 
path can be drawn as by the authors in figure 1. 

The domestic adjustment path and final equilibrium are not as 
described by the authors. The fall in X¥, which occurs due to the 
productivity shock, causes a permanent increase in its relative price 
(see n. 2 above). This reduces home labor’s real wage and perma- 
nently reduces X,. Therefore, X, = 0 also shifts left. It shifts less than 
Xt-= 0. The permanent increase in the relative price of foreign 
output implies that X, falls less than X ¥; + = O also shifts left due to the 
decline in X, and X¥ caused by the fall in foreign productivity. The 
adjustment path and long-run equilibrium are similar to those 
abroad. The stagflation is temporary in both countries, but the fall in 
output is permanent.® 

In conclusion, this note confirms that the wage-price spiral does 
cause a fall in foreign output, whatever its cause, to be transmitted as 
stagflation in the short run, as the authors claim. However, it should 
be noted that the wage-price spiral also causes a permanent foreign 


3 Hamada has noted that the fall in output could be permanent under the assumption 
that labor supply is independent of the real wage in the long run. This assumption 
seems less appropriate in a paper about the wage-price spiral transmission mechanism 
in which the short-run dynamics depend crucially on the effect of changes in the real 
wage on labor supply. 
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productivity deterioration to be transmitted as a long-run decline in 
domestic output. The permanent terms of trade deterioration, im- 
plied by the productivity shock, reduces the purchasing power of the 
domestic workers’ wage leading to less employment and output. 
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Market Constraints as a Rationale for the 
Friedman-Savage Utility Function 


Elie Appelbaum 
University of Western Ontario 
Eliakim Katz 

Bar-Ilan University 


I. Introduction 


Since the classic paper by Friedman and Savage (1948), it has gener- 
ally been accepted that the observed fact that individuals or firms? 
participate in unfair lotteries and other forms of unfair risk taking” 
may be explained by a section in the individual’s utility function in 
which the individual shows risk preference rather than risk aversion. 
In their model, Friedman and Savage specify a utility function which 
is, in turn, concave, convex, and concave, thus allowing for simultane- 
ous purchase of insurance (risk aversion) and participation in lotteries 
(risk preference). 

However, while their specified utility function is indeed capable of 
explaining observed behavior, it does seem to be rather unsatisfactory 
in that it is an ad hoc specification. It is the purpose of this paper to 
suggest a set of circumstances which give rise to a Friedman- 
Savage-type utility function. 

In particular, it is shown that when certain capital market imper- 
fections exist the utility function defined over intermediate wealth 
should be distinguished from and may have different properties than 
the one defined on final wealth. Then, even if we accept the common 
assumption made in the literature that individuals are risk averse, that 
is, that their utility function is concave over final wealth, it is still 
possible that they participate in unfair gambling (and, of course, may 
also purchase insurance). 


An earlier version of this paper was presented at the European Meeting of the 
Econometric Society, Athens, September 1979. We wish to thank the editor and an 
anonymous referee for their helpful comments. 

1 The word “individual” is, throughout this paper, used to denote economic units, 
including both individuals and firms. 

> The term “unfair lottery” in the paper encompasses all investment opportunities 
offering actuarially unfair risks. 
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The motivation behind our endeavor to provide an institutional 
explanation for gambling is a reluctance (shown, e.g., by Stigler and 
Becker [1977]) to use non-well-behaved utility functions as expla- 
nations for economic phenomena without some a priori reasons for 
assuming that the utility function is of an unusual shape. Clearly all 
phenomena in economics can be technically explained by recourse to 
tastes. Thus, for example, individuals will necessarily show risk pref- 
erence over a section of their utility function which is assumed con- 
vex. However, is there any a priori reason to assume that there may be 
such a section? It is a positive answer to this question which this paper 
attempts to provide. 

In Section II we discuss certain capital market imperfections and 
their implications about the nature of rates of return on investment. 
In Section III we show how the Friedman-Savage utility function may 
emerge when utility is defined on intermediate wealth and there exist 
capital market imperfections. The possibility of gambling with risk 
aversion and of simultaneous gambling and insurance therefore im- 
mediately arises. 


II. Rates of Return and Capital Market Imperfections 


A number of recent studies discuss the effects of uncertainty, im- 
perfect information, and various transaction costs on capital markets 
and show that the result may be that capital markets are characterized 
by certain imperfections. These imperfections are usually in terms of 
the nonexistence of certain markets and the fact that there may not be 
free and equal access to other markets. Uncertainty and the possibility 
of costly default may lead lenders to introduce collateral requirements 
or take default costs into account in their loan rates. Consequently, it 
can be said that one has to have certain assets (providing collateral 
services) in order to have easier or cheaper access to capital markets. 

Barro (1976) and Benjamin (1978) derive these conclusions ex- 
plicitly and show that market imperfections will lead to loan rates 
being functions of loan sizes and available collateral. In particular, 
they derive a loan-supply function which is constant for some initial 
range and then becomes an increasing and convex function of loan 
sizes. 

In the same vein, Jaffee and Modigliani (1969) show that, as a result 
of uncertainty and imperfect and costly information, capital markets 
may be characterized by credit rationing. They show that beyond a 
certain point loan rates will generally depend on loan sizes, with a 
possible upper bound on loan sizes. Furthermore, they provide 
empirical evidence supporting these types of imperfections. 

Empirical evidence indicating capital market imperfections is also 
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given by Eckstein (1961) and Nerlove (1968), who find capital markets 
to be characterized by differential rates of return on given invest- 
ments. 

The important implication of the capital market imperfections is 
that individuals face a variety of capital market constraints and, there- 
fore, do not have free and equal access to the market. Consequently, 
capital provides additional services by either weakening the accessi- 
bility constraints or by reducing the cost of the acquisition of capital. 
By providing collateral services, capital, therefore, increases the set of 
“feasible activities” or reduces their cost, so that its “full rate of 
return” should take this additional role into account. 

A consequence of these capital market imperfections is that for 
some levels of asset holdings the rates of return on assets may depend 
on the levels of the asset holdings, even from an individual’s (rather 
than an aggregate) point of view. The range within which this is likely 
to happen is when wealth levels are low, but not below some minimum 
level. When an individual’s wealth is below some minimum level the 
capital market constraints may be so effective that except for the 
possibility of obtaining small loans the market is in effect inaccessible. 
Since small loans can usually be obtained at constant loan rates (as is 
shown in Jaffee and Modigliani [1969], Barro [1976], and Benjamin 
[1978]), the additional role of asset holding is ineffective within this 
initial range, and consequently the rates of return will not depend on 
asset holdings. However, as an individual’s asset holdings increase 
above some minimum level, his higher level of wealth will provide him 
with the additional “collateral services,” and rates of return within this 
range will depend on the levels of wealth. 

Since rates of return are usually not known with certainty, the 
dependence of rates of return on asset holdings is in a probabilistic 
sense. In other words, the rates-of-return probability distribution 
function changes with asset holdings, that is, the distribution is condi- 
tional on asset holdings.* 

In addition to the above considerations, rates of return may be 
increasing functions of wealth (over some initial range) for other 
reasons as well. First, a large number of fixed costs may be incurred in 
effecting profitable investment. Information costs incurred in locat- 
ing high-return investments may be considerable, and a large element 
of these costs may be relatively fixed. Transaction costs, where again 
the large part is fixed, are also likely to comprise a high percentage of 
small investments. In the presence of these costs the rate of return will 
be an increasing function of the investment. Second, there may exist 


$ Thus, e.g., an increase in asset holdings will over some range make the probability 
distribution more stochastically dominant. 
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significant indivisibilities which (especially in view of the capital mar- 
ket constraints) will imply increasing returns to scale. 

In view of these considerations, we conclude that individuals, espe- 
cially those with small or moderate levels of wealth, may very often 
face various capital market constraints which lead to rates of return 
being functions of wealth. Furthermore, the relationship between 
wealth and the rates of return is such that rates of return are (at least 
over some range) increasing functions of wealth. 


III. Gambling with Risk Aversion 


Having discussed some of the capital market imperfections and the 
constraints they impose on individuals, we now consider their effects 
on the individual’s attitudes toward risk taking. 

In order to focus on the gambling problem and to separate it from 
the investment problem, we assume that the rate of return is nonran- 
dom. This assumption is not crucial and does not change the results. 
For the sake of simplicity, we also assume that there is only one asset. 
A discussion of the case with many assets and random rates of return 
can be found in Appelbaum and Katz (1979), where the same results 
are derived. 

In line with the discussion in the previous section, it will be assumed 
that the individual faces a rate of return R on his investment, A, such 
that 


Ro = 0 0 <A S A* 
R = R(A) R' >0 forAt SA SA** (1) 
R' = 0 for A > A**, 


where R’ is the partial derivative of R. In other words, the rate of 
return is constant over some initial range, increasing over some sub- 
sequent range,’ and then constant again. 


An individual investing an amount 4 will end up with final wealth 
W, such that 


W = A{1 + R(A)]. (2) 


Clearly, convexity of R(A) over some range is sufficient, but not 
necessary, for the convexity of W(A) over the same range; both convex 
and concave rate-of-return functions may lead to a convex W(A).5 


* This type of return function is discussed also in Blinder (1974) and Appelbaum and 
Harris (1978). 

° It does, however, seem reasonable that at least over some initial range (when R is not 
constant) R” > 0, i.e., the function is convex. The convexity of R(A) at the initial range 
would, e.g., follow from the convexity of the loan-supply function derived by Jaffee and 
Modigliani (1969), Barro (1976), and Benjamin (1978). 
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Let us now consider the individual’s decision problem. Following 
the literature,® we assume the individual has a utility function defined 
on final wealth, U(W), and that 


ae 0 OU |): (3) 


that is, the individual is risk averse. 
Now, consider the individual’s utility function when it is defined 
over A—his intermediate wealth. This is given by 


V(A) = U{A[1 + R(A)]} (4) 

and 
Vay — UF {1a R + AR), (5) 
Ve oh An) aU (2h! + AR’). (6) 


Therefore, in accordance with our above discussion, V' > 0 for all 
values of A. For A < A* and A > A**, V'" is clearly negative. Within the 
range A* <A = A** the second term on the right-hand side of (6) may 
be positive and, hence, V” may be positive. If this occurs there will be a 
section of V(A) that will be convex and, hence, V will show the 
Friedman-Savage shape of being in turn concave, convex, and con- 
cave. 

The result may also be derived diagrammatically, as is shown in 
figure 1. The curve V(4) drawn in the first quadrant is clearly a 
Friedman-Savage type of utility function which allows both gambling 
and insurance in intermediate wealth to take place simultaneously. 

Even if V(A) is concave within the range A* < A < A**, we can still 
get a convex region, as in the Friedman-Savage utility function, since 
V(A) will, in general, have a kink at 4*. Thus, although in this case 
V(A) is made of concave segments only, it will nevertheless not be 
globally concave. This case is shown in figure 2, where again it is 
possible to observe gambling and insurance simultaneously. 

Hence, we have provided an explanation of the shape of the 
Friedman-Savage utility function without suggesting that an indi- 
vidual is anything but everywhere risk averse. Rather than focus on 
the tastes of the individual, we have focused on his institutional 
constraints.’ 


6 See, e.g., Arrow 1970. 

7 An alternative way of obtaining these results is to employ the state preference 
approach and consider the effects of capital market imperfections on the set of accept- 
able gambles. It can be easily shown (see Appelbaum and Katz 1979) that the isoutility 
curves defined over two random states are not necessarily convex, and thus both 
gambling and insurance buying may occur simultaneously. 
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IV. Conclusion 


This paper provides a possible rationale for the Friedman-Savage 
utility function. It suggests that the existence of market imperfections, 
in particular in the capital markets, may impose various constraints on 
individuals and thus affect their behavior. The existence of these 
constraints could lead individuals to participate in unfair gambling, 
since this may reduce the implicit costs of the constraints. 
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Our explanation of gambling focuses on the constraints facing an 
individual rather than on his preferences, and consequently it does 
not require a modification of the standard assumption that individu- 
als are risk averse. Of course, it may be that individuals show risk 
preference over a section of their utility function. Such a section is 
not, however, necessary in order to derive a Friedman-Savage type of 
utility function. 
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Foreign Trade Regimes and Economic Development: Anatomy and Consequences of 
Exchange Control Regimes. By JAGDISH N. BHAGWATI. 

Cambridge, Mass.: Ballinger Publishing Co. (for National Bureau of Eco- 
nomic Research), 1978. Pp. xix+232. $16.50. 


This book is one of two volumes synthesizing the results of the NBER’s 
project on Foreign Trade Regimes and Economic Development. The country 
studies covered Turkey, Ghana, Israel, Egypt, the Philippines, India, South 
Korea, Chile, and Colombia. Taken together with the country studies under- 
taken as part of the OECD’s Industry and Trade project, whose results were 
summarized in the well-known volume by Little, Scitovsky, and Scott (1970), 
development economists now have a wealth of analytical and empirical mate- 
rial from which to judge the past trade policies of developing countries, as 
well as to make firmer policy pronouncements about future trade policy. 

Bhagwati’s synthesis volume covers two of the three issues with which the 
project was concerned, namely, “the characterization of the anatomy of ex- 
change control regimes, including the interaction between quantitative and 
price measures” (p. 2), and the relationship between different types of 
exchange-control regimes and economic efficiency both in its static and 
dynamic aspects. In its third dimension, the project attempted to discover if 
there was “something systematic about the forces that governed the likelihood 
of success in moving from a restrictive exchange control regime to a 
liberalized regime” (p. 3). This is the subject matter of the companion volume 
by Anne Krueger (which is not reviewed here). 

Within the compass that Bhagwati has set for himself, he has succeeded in 
providing a rigorous analysis and scrupulously fair summary of the evidence 
that he and others think is relevant to the anatomy and consequences of 
exchange-control regimes. 

The starting point for Bhagwati’s taxonomy of alternative exchange- 
control regimes is the limitations of the traditional Marshallian proposition on 
the equivalence of tariffs and quotas when its implicit assumption that the 
supply and demand schedules for foreign exchange are given breaks down. 
The actual ways in which imports (and/or exports) are regulated therefore 
become important in determining the relative domestic prices of traded 
goods and, hence, the welfare effects of the trade regime. As Bhagwati’s 
subsequent taxonomy of the regulation mechanisms observed in the country 
studies shows, they are a bewildering variety, and if nothing else his taxonomy 
should help the practitioner attempting to analyze a particular LDC’s trade 
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regime to organize his or her thoughts and, as such, is a valuable addition to 
the literature. The rest (and major part) of the book summarizes the evidence 
from the project (and elsewhere) on the consequences of exchange-control 
regimes. 

Aside from documenting the reasons for and evidence on the illegal trans- 
actions that different types of exchange-control regimes have entailed, the 
bulk of Bhagwati’s book (pp. 82-218) is concerned with the static and 
dynamic efficiency effects of exchange-control regimes. The project used 
estimates of domestic resource costs (DRCs) as indicators of static efficiency. 
Bhagwati provides a lucid discussion (pp. 88-90) of the statistical and ana- 
lytical pitfalls in the use of DRCs as opposed to “shadow prices” as measures 
of the misallocation of given resources. But despite these shortcomings, he 
rightly argues that “there is no easy way out of the conclusion that the pattern 
of investment was less than optimal” (p. 91) under the project Phase Two (-e., 
QR [quantitative restrictions] based) regimes. Moreover, estimates of social 
rates of return based on “shadow prices” for at least India (see Lal 1980) show 
that these conclusions are unlikely to be overturned by the use of more 
appropriate indicators of allocative efficiency. 

The evidence for the NBER country studies thus provides further support 
for the major conclusion of the earlier OECD study on the misallocation of 
resources resulting from import-substituting QR regimes. Thus Bhagwati 
writes, 

the weight of the evidence in the countries in the Project then is in 
support of the hypothesis that exchange control regimes by 
cheapening the cost of intermediates and capital goods influence and 
distort both the choice of techniques within activities and the relative 
composition of outputs and investments among activities. . . . while 
part of the evidence is inferential in deducing effects from the 
observed institutional features of the exchange control regimes of 
the Phase II variety and only part is direct, the overwhelming bulk of 
it is consistent with the view that the static efficiency effects of such 
regimes have been adverse in practice. [Pp. 117, 121] 


The evidence presented on the dynamic effects of alternative trade regimes is 
less conclusive, though Bhagwati states that the major difference between the 
NBER and OECD projects was “in analyzing in far greater depth the growth 
effects as well as the conditions determining a successful transition from 
restrictive to liberalized regimes” (p. 6). The latter aspect is not discussed in 
this review, as it is the subject matter of Krueger’s volume. 

There is a long chapter reviewing the possible analytical links (and evidence 
thereon) between domestic and foreign saving and alternative trade regimes. 
Bhagwati rightly concludes that the evidence from the country studies and 
other econometric exercises does not support the view that restrictionist 
exchange-control regimes “will, or are likely to contribute to increased 
domestic savings and/or to augmented capital formation. If anything, much 
of our evidence—at least on the domestic savings issue—suggests an opposite 
relationship” (p. 174). 

The project studies also provide documentation for the expected positive 
response of exports to a reduction in the bias imposed by an import- 
substituting QR regime; and moreover, as Bhagwati rightly emphasizes, “the 
project results indicate strongly that it is really a shift to successful liberaliza- 
tion and therefore continuing liberalization that is critical to improved export 
performance on a sustained basis” (p. 184). 
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The evidence cited and assessed on various other possible dynamic effects 
on entrepreneurship, innovation, and technical change and income distribu- 
tion is very inconclusive, though it does support the negative conclusion that 
there is little basis for the view that there are beneficial dynamic effects in 
these dimensions which outweigh the static efficiency costs of import- 
substituting QR regimes. 

Similarly, while “the cross-sectional contrasts among the Project countries 
indicate strongly that the countries that have shifted to [liberalized] Phases IV 
and V have better growth performance than those that have continued in 
Phase II-type restrictive regimes” (p. 191), the evidence for the reasons cited 
for why this may be the case again appears to be inconclusive. 

Bhagwati, like many other development economists, asymmetrically labels a 
movement from the neutral free-trade position as import substituting, while 
a movement back toward this point is labeled export promoting. In an age 
given to slogan mongering, I am not sure that this redefinition of a policy of 
promoting free trade (or “optimal” trade) is necessarily helpful, as it could 
lead people (who have not read Bhagwati with care) to believe that a policy of 
export subsidization above the level which restores the equality between the 
effective exchange rates on imports and exports was desirable. Like the 
Little-Scitovsky-Scott volume, Bhagwati’s message, though more muted, is 
that both in theory and practice a movement toward free trade is desirable in 
most developing countries, and it would seem best to say so clearly and 
emphatically. 

I was also troubled by Bhagwati’s hesitancy in arriving, as he puts it, “at 
strong prescriptive conclusions in regard to many of the detailed questions 
that are examined” (p. 5). In particular, on the dynamic effects of trade he 
states that, while most “general propositions [concerning the links between 
trade, savings, and growth] are not sustainable, individual cou ntry experiences 
may and often do indicate the dominance of only a subset of the possible 
linkages and effects, thus sustaining stronger propositions in regard to their 
own experience” (p. 5). But this raises the methodological question of the 
nature of the evidence required to substantiate the causal links in a chain of 
deductive economic argument linking various inductions. For, if one accepts 
the view that the dynamic case for free trade is that it is, in Irving Kravis’s 
(1970) phrase, “a handmaiden of growth,” it is not clear what other sort of 
general evidence would need to be adduced to support this view, apart from 
the historical and therefore counterfactual evidence presented for each 
country in the NBER sample. Surely on this basis we should be able to form a 
general prescriptive judgment in the dispute that arose during the 1950s 
between Nurkse and his followers on the one side, and Haberler and Viner on 
the other, on the desirability of free trade for postwar developing countries 
(see the quote in Bhagwati, p. 206). 

The Nurkseian skepticism about the desirability of free trade for contem- 
porary LDCs was based on three implicit or explicit assumptions. First was a 
pessimistic assumption about the external obstacles to LDC export growth, so 
that a shortage of foreign exchange was perceived to be chronic and an 
independent limiting factor in economic development. Second, economic 
agents in LDCs were assumed to be congenitally shortsighted in basing their 
investment and production decisions on current static rather than emerging 
dynamic comparative advantage. Finally, it was assumed that bureaucrats and 
planners could both predict the country’s dynamic comparative advantage 
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better than private agents and ensure that production and investment pat- 
terns conformed to it through detailed state intervention. The OECD and 
NBER studies for the various countries have documented the inadmissibility 
of all three sets of assumptions. The obstacles to LDC export growth were 
largely internal, not external; economic agents in these countries have reacted 
to the distorted set of incentives created by protectionist trade-control re- 
gimes much in the way that standard economic theory would predict; plan- 
ners have often shown a lack of foresight which would have swiftly bank- 
rupted any private agent. 

It is therefore surprising to find Bhagwati writing that “the statistical 
evidence and analysis of this possible cause and others of asymmetrical ad- 
vantage of the EP strategy is not yet available in anything like the degree that 
would be reasonably compelling” (p. 215). Given the largely simple 
econometric evidence that Bhagwati cites in his discussion of the dynamic 
effects of trade regimes, I suspect that this is the kind of “compelling” 
evidence he is seeking. However, it is doubtful whether it will ever be possible 
to find evidence for the truly dynamic effects of trade regimes from which the 
usual statistical inferences could be made, and for a very compelling reason: 
Economics, as Hicks (1979, p. 38) has put it, “is on the edge of science and of 
history.” The historical aspects are particularly important for what he terms 
“sequential causality,” which is the relevant notion of causality when we seek 
to understand the dynamic effects of trade regimes. But in studying such 
dynamic historical processes, the use of techniques of statistical inference may 
not be very useful, because “when we cannot accept that the observations 
along the time series available to us are independent or cannot by some device 
be divided into groups that can be treated as independent, we get into much 
deeper water. For we have then, in strict logic, no more than one observation, 
all of the separate items having to be taken together. We are left to use our 
judgment, making sense of what has happened as best we can, in the manner 
of the historian. Applied economics does then come back to history after all” 
(licks 19794 p:p1 12); 

The great strength of the NBER and OECD country studies is that they do 
provide analytical economic histories of particular LDC’s trade regimes. 
These can and should be used to provide judgments on the validity or 
otherwise of one or the other “view of the world,” but it is probably a vain 
hope to expect that it will ever be possible to derive “general laws” like those 
of physics for these dynamic historical processes by the use of statistical 
inference. May the inconclusiveness of Bhagwati’s discussion of the dynamic 
effects of alternative trade regimes perhaps be due to his setting himself an 
impossible standard of evidence for substantiating the view that free trade 
regimes are the “handmaidens of growth”? 

But perhaps this is being unfair to Bhagwati, who has written a lucid and 
scholarly book which, in his words, “the reader should approach . .. as a 
manual. Anyone who wishes to examine a specific country’s foreign trade 
regime and its consequences can read the present volume and readily have 
access to possible hypotheses and techniques of analysis . . . that may be used 
for the reader’s own analysis” (p. 5). In this he has succeeded and deserves 
commendation. 


DEEPAK LAL 


University College, London 
World Bank 
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Population and Economic Change in Developing Countries. Edited by RicHarp A. 
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This volume, number 30 of the Universities—National Bureau Committee for 
Economic Research conference reports, makes available papers and dis- 
cussants’ comments originally presented to the Conference on Population and 
Economic Change in Less Developed Countries held in Philadelphia in 1976. 
Nine distinguished authors have written major papers (a fact which must be 
stated when reviewing a conference volume, for, far too often, prominent 
researchers produce their second best for such occasions). Every paper is 
worth reading, and most are exciting. The volume will be necessary reading 
for people working in this and allied fields. 

Yet the book may, in the years to come, be thought of as a monument in the 
final years of a profoundly unsatisfactory period—the Aristotelian era—in the 
study of the economic demography of developing countries. There are two 
major problems in the volume, and both are essentially problems of the 
discipline, faithfully reflected in the conference. 

The first problem is that hardly any of the authors, and very few of the 
discussants, have ever spent any substantial period living and researching in a 
developing country. More limited experience is often just enough to convince 
the researcher, quite falsely, that he is in contact with a poorer version of our 
own society. It is as if anthropologists could still work without years of field 
experience, or as if geographers could write books from afar by merely 
interpreting local maps. 

This is not just carping criticism. The central theme of both the conference 
and the book, addressed by five of the nine papers, is Third World household 
economics and individual behavior and decision making. These papers are 
saturated with hesitant inferences, drawn from piles of anonymous data 
collected in a society which the authors really do not know, about how these 
households must work. Very rarely does one of the writers say with any 
assurance that this or that kind of situation is usually found in the country 
being discussed. The loss is immense and can be illustrated by some related 
examples. The value of children’s labor is compared with that of the adults in 
the family, or of outside paid labor, or that of slaves. No one points out that 
one reason for preferring children to adults is that they do various forms of 
menial work—“children’s work” —that adults hate doing. No one points out 
that, throughout much of Africa and Asia, only family members are trusted, 
and the presence of hired labor in the house or even the surrounding fields 
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gives rise to disquiet. No one shows that children’s labor is valuable only so 
long as they work hard with little resentment or protest, and that this is less a 
product of their existence than of a social system in which children expect to 
have their work directed by their elders and the older generation takes such 
direction for granted. It is a system which can be subtly transformed if 
cultural change (or the advent of schooling) alters the social relations between 
the generations. The market wage rate is frequently taken to measure the 
relative value (which must in a sense mean the intrinsic value and not be 
merely a synonym for market wage) of the labor of men, women, and 
children. Yet in a South Indian village in which my wife and I have lived and 
researched, although women are the preferred workers in certain postharvest 
operations because of the superior skills they are said to possess, their daily 
wage rate for that work is five-sevenths of that paid for the same work by men. 
Children’s wages are lower still, even for tasks where they are demonstrably 
just as efficient as their elders. Wage levels do not measure work inputs, but 
differentials by sex and age are maintained in order to preserve a certain 
social fabric. 

The second problem—related in the sense that it also arises from sub- 
stituting pontification of a type for which we have condemned medieval 
schoolmen for long experience of the societies being analyzed—is the ten- 
dency to treat each society as autonomous. To a considerable extent this was 
the case in premodern Europe in that external cultures had no major impact, 
and hence Ronald Lee’s perspective of preindustrial England escapes this 
censure. Yet it is not true in today’s developing world. When we have asked 
people in both Africa and Asia about the major distinctions between their 
ways of life and those of their parents or more distant ancestors, they have far 
more often quoted aspects of social change than of economic development. 
Furthermore, they overwhelmingly explain such social change—and almost 
certainly correctly—in terms of imported Westernization (or social moderni- 
zation) or because of learning other ways of life in school (seen quite clearly as 
an imported institution), rather than as the minimum alteration needed for 
accommodation to economic change. The social influence of the West, 
whether approved or disapproved, is felt to be immense. The decline of the 
patriarchy with all its economic advantages to older males is seen to have come 
about because of the new ideas. For most of the developing world the real fact 
of life is not a gradual development from the preexisting economic and social 
base but an accelerating revolution that has come about because of the 
extraordinary movement in this century toward a global economy (frequently 
noted in contexts other than this volume) and a global society (much less 
frequently noted). Yet almost every analysis in the book implies an almost 
autonomous economic and social change in every society, even tiny national 
states. That this is not really so can be inferred from some of the findings; for 
instance, Samuel Preston’s proof that perhaps only one-third of the mortality 
decline this century in developing countries can be explained by rising income 
levels, while the other two-thirds (although it is not put this way) arises from 
the ever stronger interrelations between industrial and nonindustrial societies 
and indeed among all societies. This blind spot occurs not only because of 
inexperience of the Third World, but because the researcher becomes a 
prisoner of his own methods of analysis. The data for each country or society 
can easily be interrelated, but they cannot easily be made to divulge the extent 
to which change in their magnitudes is a result of external forces. In addition, 
the treatment of each society as autonomous allows apparently meaningful 
comparisons over distance and time. 
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There is a feeling, at least in demographic circles, that the era when 
approaches of this type have been possible is drawing to a close. The Interna- 
tional Review Group of Social Science Research on Population and Develop- 
ment (IRG) Report, a forthcoming report of the International Union for the 
Scientific Study of Population (IUSSP), and other individuals and organiza- 
tions are all urging population researchers toward microstudies, an- 
thropological demography, and saturation in specific cultures. We are in the 
situation anthropology found itself in 60 years ago when Malinowski in 
Britain and Boas in the United States were trenchantly stating the necessity 
for knowing the real world. Economic theory has not crumbled through lack 
of such experience when it deals with the industrialized world only because 
most of its practitioners have known from a lifetime’s experience the basic 
nature of that world. First World specialists in the population and economy of 
the Third World have not found themselves in such a fortunate position. 

The first chapter takes the form of an excellent review and analysis by Peter 
Lindert of child costs from both contemporary and historical data. Yet today’s 
Third World seems almost as far off as eighteenth-century England, and 
inferences about the Third World seem about as direct as today’s inferences 
about eighteenth-century England. 

The heart of the debate is provided by chapter 4, an attempt by Paul 
Schultz to apply the theories of the Chicago household economists to 
Taiwanese data, and chapter 5, a restatement by Richard Easterlin, Robert 
Pollak, and Michael Wachter of the University of Pennsylvania’s tastes and 
preferences approach, this time written as a specific reaction to the household 
economists. 

The application of the Chicago school’s theories to developing countries 
has always been disconcerting because of its implication that the options for 
the use of time open to the middle-class American woman bear some relation 
to the situation of village women in the Third World. We have never been 
able to detect much difference in the allocation of time in the latter group 
between those who went to school and those who did not (ignoring the tiny 
fraction with higher education). The reason seems to be a cultural enforce- 
ment of the tasks which women must do and a sharing of child care between 
different members of the family and other relatives regardless of the mothers’ 
level of education, plus a fairly rough-and-ready raising of children that often 
means little time spent exclusively on each child and little emotional drain. 
The differences that can be detected seem to run in the direction opposite 
that required by the Chicago school: A more educated mother can sometimes 
exert enough influence to increase the amount of time she spends with her 
children (and to insist that they are more exclusively her children) at the 
expense of more onerous work in the fields or in food processing. The 
application of the theories to Taiwan is particularly disconcerting for two 
reasons. The first is the now customary use of past education for current 
value of time in a society where education probably has far greater direct 
impact on age at marriage and use of contraception (as is shown by Ronald 
Freedman in his comments) and a profound impact on family relations and 
hence the ability to profit from family members’ labor. It is high time that the 
Chicago school gave itself a basis for working in the Third World by en- 
couraging studies at the household level of real time options and the making 
of choices. Such studies of 20 households would be as worthwhile as the 
comparison of thousands with macroscopic data employing probably wholly 
inappropriate proxies. The second, one readily conceded by Schultz, is the 
fact that comparisons could not be made with population units smaller than 
provinces (which, as Freedman pointed out, tend to be either city or noncity). 
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The Pennsylvania paper makes the point, which is universal knowledge in 
the Third World but is often ignored in the economic models of the First 
World so as to make them apparently universal, that before the onset of 
fertility transition there is very rarely any worry at all about the level of 
marital fertility (except when it is too low) and no greater attempts are made 
to restrict fertility at higher than at lower parity levels (a situation known 
somewhat unhappily as “natural fertility”). The age at marriage or the dura- 
tion of sexual abstinence has nothing to do with intended ultimate family 
size. Indeed, decision making about family size does not exist and therefore 
cannot be studied until the onset of fertility transition—a forbidding situation 
for economic demographers because it rules out the universality of most of 
their models and the ability of these models to explain not only the conditions 
of stable high fertility but also the situation immediately preceding (and 
crucial in determining the timing of) the beginning of transition. The 
Pennsylvania trio makes the point that tastes and preferences must exist, but 
it seems to regard them as having some kind of life of their own rather than 
having underlying determinants (this is almost possible if you can convince 
yourself that the wobbles in the Western fertility decline that produced the 
so-called baby boom of the 1950s, and the falling fertility of the 1970s, are 
important phenomena in the transformation of stable high-fertility societies 
into low-fertility ones). They make many good points in producing a model so 
general that it borders on being a philosophic discussion rather than a tight 
description of the conditions of change (and, as Warren Sanderson charges, 
“the model is consistent with almost any kind of behavior”). 

Allen Kelley, employing a fraction of other people’s data collected for other 
purposes in urban Kenya, comes to some fascinating conclusions: The 
number of children does not influence the rate of household financial saving; 
“the income effects of an increase in education may in some instances out- 
weigh the price-substitution effects, resulting in an increase in family size with 
increasing education levels”; and (stating the obvious, although it is said to be 
a radical departure from the conventional wisdom) “contrary to the usual 
model where children are considered only as consumer durables, there is 
some evidence that children may also serve as producer durables in the 
low-income setting” (p. 436). The basic weakness of the paper is that the 
determination of savings by subtracting vastly greater expenditure figures 
from income figures, in a situation where neither is reliable, is wholly uncon- 
vincing. 

Yoram Ben-Porath, in a paper which can be related only loosely to the 
major theme of the book, economic development and social change in a 
continuing society, examines the impact of the migration of Jews from Africa 
and Asia on their fertility and their replacement of dead children. His 
methods are ingenious, and he appears to show that “the income effects [in 
raising fertility] may precede in time the adjustment in life-style and aspira- 
tions that generates the fertility-depressing substitution effects” (p. 181). One 
problem is that more than life-style and aspirations changed: Economic 
realities both outside and inside the family did (the youth, who would work 
uncomplainingly in the family business in Yemen, wanted to seek an outside 
job in Israel and to control the spending of his earnings), as did the mortality 
of children. The other is the fact that the separation of demographic and 
other events between the pre- and postmigration periods is so difficult that 
one is not quite convinced that it has been wholly successfully achieved. 

The volume does contain three fascinating macroscopic papers. One by 
Simon Kuznets argues (in a way that is necessary but should not be) that the 
mortality decline in developing countries over the last 60 years has not only 
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been a major contribution to real living standards but has disproportionately 
assisted the least privileged and thus has reduced social inequality. He argues 
that mortality reduction is an indispensable prerequisite for fertility decline 
and that we do not yet know the social impact of fertility decline in the Third 
World. He also shows that it is probable that the mortality declines in de- 
veloping countries were as steep between the two world wars as since World 
War II. That is, they began when colonial administration was strong enough 
and sufficiently responsible to keep law and order and to apply the available 
health technology (as well as beginning the spread of schooling). It is a myth 
that the LDCs had to wait for antibiotics and WHO programs. Indeed, there 
is support for the widespread belief in the Third World and the historic West 
that it was strong regimes that benefited populations and that such benefit 
came in the form of prosperity and was evidenced by population growth 
arising from mortality decline. 

There is evidence also (although it is not spelled out this way) in an 
important paper by Ronald Lee which extends his previous work. Mortality in 
England apparently declined consistently through the long years of the 
Tudor peace. The central theme of the analysis, however, is the apparent 
demonstration that population growth in England was for 600 years, from 
1200 to 1800, controlled almost solely by changes in mortality levels (not 
themselves determined either economically or by variations in fertility) and 
that population growth so determined real wages that during the whole 
period a 1 percent movement in population gave rise to a 1.5 percent move- 
ment of wages in the opposite direction. The note of caution is sounded 
because there is less certainty about mortality trends for the whole population 
than for the upper classes and because the argument is most powerfully 
buttressed by swings in population between 1250 and 1450 which are far 
from certain. 

Preston’s paper, referred to earlier, also analyzes the mortality decline in 
terms of changing patterns in the cause of death and bravely attempts to work 
out a range of consequences of mortality decline. 

Finally, the inclusion of Michael Todaro’s paper on internal migration, 
clearly setting out his model and presenting a bibliographical guide to other 
studies, does something to justify the broad title of the book, though it 
appears somewhat anomalous in terms of the main thrust of the work. 

The NBER volume presents, usually at one remove from the collection of 
data, much interesting material and a good deal of sophisticated analysis. 


Joun C. CALDWELL 
Austrahan National University 


The Regulated Industries and the Economy. By Paut W. MacAvoy. 
New York: W. W. Norton & Co., 1979. Pp. 160. $11.95. 


The regulatory economics field has grown rapidly since the publication of 
Alfred E. Kahn’s (1970) seminal text 11 years ago. Powerful new institutions 
have been created to regulate health, safety, and environmental problems, 
several of the older industry-management agencies have relaxed their regu- 
latory grip in response to active or threatened legislative deregulation, and 
the economics profession’s understanding of the effects of regulation on the 
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economy has improved. Paul W. MacAvoy’s latest book, The Regulated Indus- 
tries and the Economy, evaluates both the success (or, more accurately, the 
failure) of the new social regulations and the increasing costs imposed by 
industry-management agencies in this recent inflationary period, concluding 
that the quality and the quantity of goods and services produced in the 
regulated industries have declined due to regulation. 

The book aims at a general audience interested in problems of regulation 
but not necessarily well versed in economics. Its lack of coverage of many 
important topics in regulatory economics makes it unsuitable for a main text 
in the area. As a research monograph, the MacAvoy book is too loosely 
constructed to support satisfactorily many of its hypotheses. I found the data 
presented to suggest strongly some fascinating hypotheses, but often compet- 
ing hypotheses were ignored, sufficiently refined data-analysis techniques 
were not used, and some broad, only weakly supported hypotheses were 
presented. 

The book focuses on three main topics: the effects of price and entry 
regulation on the economy; the effects of health, safety, and environmental 
(social) regulation on the economy; and reform proposals. I found the evalu- 
ation of the impacts of price and entry regulation to be the most valuable 
section because its central hypothesis is important, as well as little recognized, 
and because that hypothesis is convincingly supported. The main finding 
states that due to their backward-looking, cost-based nature, the regulatory 
lag and money illusion that exist under price and entry regulation keep price 
increases behind and below cost increases, which causes a severe problem in 
high-inflation periods such as the 1970s (and, more important, the 1980s). 
This price squeeze pushes profit rates below average returns, thereby slowing 
investment and eventually leading to production shortages and reduced 
quality of service. MacAvoy illustrates his thesis with a careful analysis of the 
natural gas shortages that were caused by FPC price regulation, and he then 
persuasively argues that the electric power, railroad, and telephone industries 
may be headed along the same path. The seriousness of this result places 
reform of the regulatory process for these industries high on the political 
priority list. 

Although reasonable evidence is presented to support most of the conclu- 
sions, many are overstated, without necessary qualifications, and without 
exploration of reasonable alternative hypotheses. As only one example, 
MacAvoy cites rate relief less than that requested by electric companies as 
evidence that regulation had some price-reducing effect, when the discrep- 
ancy between the requested and the granted rate increases can also be 
explained by gaming between both parties. 

In contrast to his analysis of price and entry regulation, MacAvoy’s analysis 
of the effects of social regulation is too simplistic. He studies three agencies, 
EPA, NHTSA, and OSHA, finds each of them to impose high costs and no 
significant benefits, and then concludes that social regulation is generally 
counterproductive. é 

The consistency of the conclusion may be satisfying to some, but it simply 
cannot be supported. The literature does suggest that OSHA and NHTSA 
have provided few benefits. However, the conclusion of MacAvoy’s own 
study, that the EPA air programs have yielded no air-quality benefits, is 
difficult to accept, both because of questions about his methodology and 
because of its inconsistency with other studies. Even if OSHA, NHTSA, and 
the EPA air program did provide no benefits, it would not follow that other 
examples of social regulation provide benefits less than their costs or, worse 
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yet, provide no benefits at all. Although imposing severe costs on the economy 
in terms of forgone benefits from consumer products, the CPSC has also 
provided some benefits, if only from the added safety features it has induced 
manufacturers to adopt “voluntarily” to avoid CPSC action. And few knowl- 
edgeable about EPA water programs would argue that those programs have 
provided no significant improvements in water quality relative to what water 
quality would have been without the 1972 Water Pollution Control Act 
Amendments. 

Regulatory reform comprises MacAvoy’s third focus. He suggests reforms 
such as automatic rate increases (subject to later repayment if found to be 
excessive), the use of future test years, a requirement of economic impact 
statements for all significant regulations, and deregulation where competi- 
tion, service quality, and the economy dictate this solution. His analysis is 
quite brief. I would like to see a book which thoroughly analyzes each of these 
and other reform proposals, compares them to existing procedures, and 
evaluates all their consequences to both industry and consumers. 

Other than the problem of structuring a transition to competition in cur- 
rently regulated “competitive” industries that will gain as many of the benefits 
of deregulation as possible, the most difficult regulatory problems in the next 
decade will consist of the reform of social regulation. For example, how 
should we implement Charles Schultze’s (1977) proposal to replace 
command-and-control regulations with price incentives, such as marketable 
permits or effluent fees for environmental pollution? In rule making, how 
can we speed up the process while allowing open access to all interested 
parties and while guaranteeing that high-quality, unbiased scientific and 
economic analyses are used to support regulatory decisions? Given the ram- 
pant growth of regulation in the 1970s, can mechanisms like the existing 
Regulatory Council, the proposed regulatory budget, or the suggested legis- 
lative veto be expected to effectively coordinate many diverse and costly 
regulatory activities? These problems illustrate the need for careful thinking 
by economists about a variety of regulatory reforms that public policy must 
soon address. 


WESLEY A. MAGAT 
Duke University 
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The Rational Peasant: The Political Economy of Rural Society in Vietnam. By 
SAMUEL L. PopkKIN. 

Berkeley: University of California Press, 1979. Pp. 306+xxi. $14.95 (cloth); 
$4.95 (paper). 


The Origins of English Individualism: The Family, Property and Social Transition. 
By ALAN MACFARLANE. 

New York: Cambridge University Press, 1979. Pp. 216+xv. $19.95 (cloth); 
$6.95 (paper). 


Perhaps there is yet an economist who thinks peasants are irrational and 
collectivist. It is difficult to believe that such a benighted view of the coun- 
tryside could persist, but this is the age of wonders. In truth, it remains a 
commonplace of development planning and scientific analysis. The two books 
under review here are signs of a counterattack. 

Clear and important arguments can be described by their rhetorical dual as 
well as by their primal, by what they attack as well as by what they affirm. So 
here. As his title suggests, Samuel Popkin, a young political scientist at the 
University of California at San Diego, is attacking the World Bank view that 
peasants are idiots and the more sophisticated view, articulated recently by 
James C. Scott in The Moral Economy of the Peasant (1976), that peasants are 
rational but rationally communitarian. Popkin believes they are not com- 
munitarian in general and were not in particular over the past century in his 
country of study, Vietnam. He believes that the economic rationality of 
Vietnamese villagers in choosing crops was at one with their political rational- 
ity in choosing allies, both in the service of individual well-being. 

The evidence for his beliefs is persuasive. The first third of the book is a 
long essay on “The Peasant,” not necessarily Vietnamese, marshaling the 
latest work by medievalists and anthropologists (except Macfarlane’s, pub- 
lished too late) against the views of Karl Polanyi, George Dalton, Eric 
Hobsbawm, Eric Wolf, James Scott, Joel Migdal, and Clifford Geertz. The 
“moral economists” “assume that peasants are antimarket, [that they] prefer 
common property to private, . . . [and] that peasant welfare depends on the 
closed corporate villages... . The fabric of the moral community is rent. . . by 
the intrusion of the ‘cash nexus’” (pp. 5—6). The “political economy” view, as 
Popkin calls it, notes that the evidence for the moral economy view is weak. 
Peasants from Tepotzlan to Vilyatpur and from the 1280s to the 1970s, 
among other evidences of selfish and clearheaded maximizing, did not help 
each other to pay taxes (p. 41), demanded strict reciprocity in charity (p. 47), 
used gleaning rules to prevent misallocation of labor (not to help the poor [p. 
52]), expected to grow rich out of village offices (p. 60), resisted innovation 
when it was rational to do so (p. 66), and engaged in exchanges of land or 
crops without social disaster (p. 68). The opening chapters constitute a bril- 
lant survey of the context within which economists measure the supply re- 
sponse of African farmers and the transformation of traditional agriculture. 
A central planner who can read Popkin without some twinge of doubt that his 
bureaucracies, communes, and other substitutes for voluntary exchange are 
good for the poor (p. 29) is a hard man to persuade. 

The rest of the book may do the trick. It is a narrative of Vietnamese 
political history down to 1946 emphasizing the impact of colonialism on the 
politics of villages. The narrative, like the analytical essay, is skillfully done, 
although of course it is difficult in any case for an American over 25 to yawn 
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at the names Hue, Tonkin, Cho Lon, and the like. The writing is modest, 
clear, and warm. So far as can be judged from his surefootedness in the 
literature on the European peasantry, Popkin’s opinions in fields outside my 
competence are to be taken seriously. The index testifies to his heavy reliance 
on Blum and Hilton for Europe, and on Gourou, Gram, Ory, Osborne, 
Rambo, and Woodside for Vietnam. The book is generous to its enemies and 
almost embarrassing in its praise for allies. The footnotes are filled with 
references to studies that are “wonderful,” “excellent,” “must reading.” The 
burden of bringing a chaotic history to bear on wider issues is sometimes too 
great, leading to excessive repetition and summarization. And the narrative 
portion can be faulted as rather too broadly focused to make the case about 
the details of village politics the argument requires. But the book is loaded 
with good observing and good thinking. 

The major thought inspired by Popkin’s history of Vietnam under the 
French is that the village was no passive victim of colonization and commer- 
cialization. On the contrary, village notables themselves called the new world 
into existence to redress the balance in the old. The new colonial taxes, for 
instance, were occasions for the notables to consolidate their power over the 
rest of the villagers (p. 149). Before the French came, “villages were not 
egalitarian, leveling, welfare oriented, or necessarily harmonious” (p. 122). In 
consequence, the many antifeudal movements of the twentieth century, 
whether religious or communist, did not look back to a Confucian golden age: 
the Vietnamese themselves knew well the rapacity of rural life. It is the 
analysts of the peasant wars of the twentieth century, not the warriors them- 
selves, who idealize the communities the peasants came from. 

Popkin criticizes too the other half of the idealized portrait of the death of 
community, namely, the vile and domineering colonialists who murdered it. 
The flaw, Popkin argues, is to attribute effectiveness in evil to the French 
administration. The French supported 10 times as many bureaucrats per 
head of local population as did the British in India yet failed repeatedly in 
their schemes to manipulate the villagers. The story of colonialism is not one 
of effective and evil government, which needs only to be replaced by effective 
and good government to reestablish the communitarian values of the golden 
age. Itis a story of ineffective government lost in a sea of selfish peasants. The 
story should have a certain familiarity to Americans lately embroiled in 
Southeast Asia. 

Popkin recognizes that the source of the mythology of the unselfish peasant 
is history, especially English history. So does Macfarlane, whose book is a 
more direct challenge to the mythology: “a great deal of modern India, 
Africa, or South America cannot be understood without some comprehension 
of what happened on a small island with only a few million inhabitants between 
the thirteenth and eighteenth centuries” (p. 7). The dual of Macfarlane’s 
interpretation of English history is the one traditional since Marx, that Eng- 
land was like other economies long ago in having many “peasants.” The 
peasant is here not merely a countryman but the denizen of a society with 
production in the home, no markets in products, no markets in factors of 
production, no geographical mobility, great emphasis on kinship, inheritance 
by the family instead of the individual, ownership by the family instead of the 
individual, an all-pervasive family life in which women marry young and are 
subordinate to their husbands, and sharp lines between this peasantry so 
described and the nobility and its state. In brief, the peasant is the man we all 
know and love from the works of Marx, Weber, Tawney, and their recent 
(collective) heirs—the same list that Popkin called the “moral economists.” 
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The history that all educated people carry about in their heads is “a progres- 
sion from small, isolated communities inhabited by ‘peasants’ . . . towards the 
market, monetized, ‘open’ social structure of the eighteenth century” (p. 54). 
The history, says Macfarlane, is wrong. England was in fact a nonpeasant 
society from earliest times for which we have records: there was no revolution 
of the sixteenth century. Like Popkin, he buttresses his own primary evidence 
(from two English villages that he himself is studying intensively) with a 
brilliant survey of work by others bearing on the point. No medievalist will 
find the facts reviewed novel, though he may be irritated by the single- 
mindedness with which they are deployed. Indeed, anyone who finds Mac- 
farlane’s (or Popkin’s) theme entirely surprising has been relying for his view 
of English medieval society either on the works of Maitland or Vinogradoff, 
brilliant in their day (ca. 1900) but still in the grip of nineteenth-century 
German scholarship and its romantic view of the brotherhood of times past, 
or on textbooks by political historians who rely in turn on Maitland and 
Vinogradoff. 

As you and I would turn to markets, and the political scientist Popkin turns 
to politics and power, it is natural that Macfarlane as an anthropologist turns 
to family relations to test the antiquity of English individualism. He argues 
that the English family was at an early stage nuclear, formed rather late in life, 
oriented toward the individual, respectful of women’s rights, and un- 
confining. In particular, the individual, even women, inherited the family’s 
land and could dispose of it at will. The central evidence is simply the records 
showing individuals disposing of land, in large amounts and with great 
freedom—as they did labor and produce. The notion that the early medieval 
(not to speak of late medieval) economy was a “natural economy,” as the 
Germans used to put it, is flatly mistaken. Sic transit all manner of hypothetical 
histories, from Marx’s notion of precapitalist economic formations, necessar- 
ily unsubstantiated in an age before the professionalization of history, down 
to such newer abominations, which may not be so easily pardoned, as 
Polanyi’s The Great Transformation (1944), Hicks’s A Theory of Economic History 
(1969), and Wallerstein’s The Modern World System (1974). 

Like Popkin, again, Macfarlane does not prove his case beyond all doubt. 
He is as uncomfortable with quantitative arguments as were the medieval men 
he studies and tries repeatedly to force the evidence to say yes or no when it 
can only say more or less. The most troublesome feature of the book is its 
insistence throughout that individualism was a peculiarity of England. What 
might be interpreted as a device of rhetoric to silence the cries of “it’s- 
different-in-Dijon-and-Dacca” becomes in the end a disfigurement. Always it 
is “in England” or “at least in relation to England” that one after another myth 
of peasant life debunked, until finally (p. 199) we are told that by Adam 
Smith’s time “England had probably long been different from almost every 
other major agrarian civilization.” The evidence for this key assertion has the 
same weakness that Macfarlane detects throughout in others, as in the work 
of George Homans: “There appears to be a strong, self-confirming, and 
circular hypothesis. It is thought to be self-evident that the rural inhabitants 
of England [for present purposes read ‘of any particular community’] were 
basically like ‘peasants’ elsewhere. . . . It is [then] justifiable to fill in the vast 
gaps .. . by drawing information from peasants elsewhere. The picture of [the 
society] which emerges then seems to show that there were really peasants” (p. 
191): 

For all his iconoclasm in the English case, then, Macfarlane retains a 
respectful attitude toward the notion that peasants elsewhere are a very odd 
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sort. The antidote is of course Popkin—Vietnam is most assuredly 
“elsewhere.” From the union of these two books little remains of the myth of 
the peasant, historical or modern. Popkin’s brief survey of the medieval 
evidence should be strengthened by Macfarlane; and Macfarlane’s reluctance 
to extend his evidence outside England should be overcome by Popkin. 
The books are not economics, though they talk much of economic matters. 
The question arises whether an economist should break off the scrutiny of his 
out-garbage from the comp center long enough to peruse a book on historical 
political science and a book on historical anthropology. Yes, he should. His- 
torical mythologies live on in the premises of the economist who believes 
himself, whether proudly or humbly, to be a mere technician. To learn that 
“neither decline nor decay of peasant institutions is necessary for peasants to 
enter markets” (Popkin, p. 267) or that “‘homo economicus’ and market 
society [were] present in England for centuries” gives the economist new 
fields to conquer, and the conquered fields may in turn civilize his economics. 


DonaLp N. McCLoskey 
University of Iowa 
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Edward P. Lazear and Sherwin Rosen 


University of Chicago and National Bureau of Economic Research 


This paper analyzes compensation schemes which pay according to 
an individual's ordinal rank in an organization rather than his out- 
put level. When workers are risk neutral, it is shown that wages based 
upon rank induce the same efficient allocation of resources as an 
incentive reward scheme based on individual output levels. Under 
some circumstances, risk-averse workers actually prefer to be paid 
on the basis of rank. In addition, if workers are heterogeneous in 
ability, low-quality workers attempt to contaminate high-quality 
firms, resulting in adverse selection. However, if ability is known in 
advance, a competitive handicapping structure exists which allows all 
workers to compete efficiently in the same organization. 


I. Introduction 


It is a familiar proposition that under competitive conditions workers 
are paid the value of their marginal products. In this paper we show 
that competitive lotteries are often efficient and sometimes superior 
to more familiar compensation schemes. For example, the large 
salaries of executives may provide incentives for all individuals in the 
firm who, with hard labor, may win one of the coveted top positions. 

This paper addresses the relation between compensation and in- 
centives in the presence of costly monitoring of workers’ efforts and 


We are indebted to Jerry Green, Merton Miller, James Mirrlees, George Stigler, 
Joseph Stiglitz, and Earl Thompson for helpful comments. The research was supported 
in part by the National Science Foundation. The research reported here is part of the 
NBER’s research program in Labor Studies. Any opinions expressed are those of the 
authors and not those of the National Bureau of Economic Research. 
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output. A wide variety of incentive payment schemes are used in 
practice. Simple piece rates, which have been extensively analyzed 
(see, e.g., Cheung 1969; Stiglitz 1975; Mirrlees 1976), gear payment 
to output. We consider a rank-order payment scheme which has 
not been analyzed but which seems to be prevalent in many labor 
contracts. This scheme pays prizes to the winners and losers of labor 
market contests. The main difference between prizes and other in- 
centive schemes is that in a contest earnings depend on the rank order 
of contestants and not on “distance.” That is, salaries are not contin- 
gent upon the output level of a particular game, because prizes are 
fixed in advance. Performance incentives are set by attempts to win 
the contest. We argue that in many circumstances it is optimal to set 
up executive compensation along these lines and that certain puzzling 
features of that market are easily explained in these terms. 

Central to this discussion are the conditions under which mecha- 
nisms exist for monitoring productivity (Alchian and Demsetz 1972). 
If inexpensive and reliable monitors of effort are available, then the 
best compensation scheme is a periodic wage based on input. How- 
ever, when monitoring is difficult, so that workers can alter their 
input with less than perfect detection, input-wage schemes invite 
shirking. The situation often can be improved if compensation is 
related to a more easily measured output level. In general, input- 
based pay is preferable because it changes the risk borne by workers 
in a favorable way. But when monitoring costs are so high that moral 
hazard is a serious problem, the gain in efficiency from using output- 
based pay may outweigh the risk-sharing losses. Paying workers on 
the basis of rank order alters costs of measurement as well as the 
nature of the risk borne by workers. It is for these reasons that it is 
sometimes a superior way to bring about an efficient incentive struc- 
ture. 

In the development below we start with the simplest case of risk 
neutrality to illustrate the basic issues. Then the more general case of 
risk aversion is treated in Section III. Section IV considers issues of 
sorting and self-selection when workers are heterogeneous. 


II. Piece Rates and Tournaments with Risk Neutrality 


To keep things simple and to avoid sequential and dynamic aspects of 
the problem, we confine attention to a single period in all that follows. 
Therefore, the reader should think of the incentive problem in terms 
of career development and lifetime productivity of workers. The 
worker’s (lifetime) output is a random variable whose distribution is 
controlled by the worker himself. In particular, the worker is allowed 
to control the mean of the distribution by investing in costly skills 
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prior to entering the market. However, a given productivity realiza- 
tion also depends on a random factor which is beyond anyone’s 
control. Employers may observe output but cannot ascertain the ex- 
tent to which it is due to investment expenditure or to good fortune 
or to both, though workers know their input as well as output. Worker 
j produces lifetime output q; according to 


qi = Mi + Es (1) 
where py; is the level of investment, a measure of skill or average 
output, chosen by the worker when young and prior to a realization of 
the random or luck component, e;. Average skill, 1;, is produced at 
cost C(), with C’, C” > 0. The random variable €; is drawn out of a 
known distribution with zero mean and variance o?.! Here e¢ is lifetime 
luck such as life-persistent person-effects or an ability factor, which is 
revealed very slowly over the worker’s lifetime. The crucial assump- 
tion is that productivity risk is nondiversifiable by the worker himself. 
That is another reason for choosing a long period for the analysis. For 
example, if the period were very short and the random factor was 
independently distributed across periods, the worker could diversify 
per period risk by repetition and a savings account to balance off 
good and bad years. Evidently a persistent person or ability effect 
cannot be so diversified when it is undiscoverable quickly, as appears 
true of managerial talent, for example. It is assumed, however, that € 
is i.i.d. across individuals, so that owners of firms can diversify risk 
either by pooling workers together in one firm or by holding a 
portfolio. 

To concentrate on incentive aspects of various contractual ar- 
rangements, we adopt the simplest technology for firms. Production 
requires only labor and is additively separable across workers. By 
virtue of the independence assumptions, managers act as expected 
value maximizers or as if they were risk neutral. Free entry and a 
competitive output market set the value of the product at V per unit. 
Again, these assumptions are adopted to illustrate basic issues in the 
simplest way. The analysis also applies when there are complemen- 
tarities among workers in production, which is more realistic but more 
difficult to exposit. 


Piece Rates 


The piece rate is very simple to analyze when workers are risk neutral. 
It involves paying the worker the value of his product. Let r be the 


1 In this paper the worker has no choice over o. This does not affect the risk-neutral 
solution but does have an effect if workers are risk averse, since they tend to favor 
overly cautious strategies. Also, virtually all the results of this paper hold true if the 
error structure is multiplicative rather than additive. 
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piece rate. Ignoring discounting, the worker’s net income is rg — C(y2). 
Risk-neutral workers choose to maximize expected net return 


E[rq — C(u)] = rp — C(p). 


The necessary condition is r = C’() or the familiar requirement that 
investment equates marginal cost and return. On the other hand, the 
expected profit of a firm is 


E(Vq-—r1rq)=V —1r)p, 
so free entry and competition for workers imply r = V. Consequently 
V =C'(p). 


The marginal cost of investment equals its social return, yielding the 
standard result that piece rates are efficient. 


Rank-Order Tournaments 


We shall consider two-player tournaments in which the rules of the 
game specify a fixed prize W, to the winner and a fixed prize W, to the 
loser. All essential aspects of the problem readily generalize to any 
number of contestants. A worker’s production follows (1), and the 
winner of the contest is determined by the largest drawing of g. The 
contest is rank order because the margin of winning does not affect 
earnings. Contestants precommit their investments early in life, 
knowing the prizes and the rules of the game, but do not communi- 
cate with each other or collude. Notice that even though there are two 
players in a given match the market is competitive and not oligopolis- 
tic, because investment is precommitted and a given player does not 
know who his opponent will be at the time all decisions are made. 
Each person plays against the “field.” 

We seek to determine the competitive prize structure (W,,W,). The 
method proceeds in two steps. First, the prizes W, and W, are fixed 
arbitrarily and workers’ investment strategies are analyzed. Given 
these strategies, we then find the pair (W,,W,) that maximizes a 
worker’s expected utility, subject to a zero-profit constraint by firms. 
It will be seen that a worker’s incentives to invest increase with the 
spread between winning and losing prizes, W, — W,. Each wants to 
improve the probability of winning because the return to winning 
varies with the spread. The firm would always like to increase the 
spread, ceteris paribus, to induce greater investment and higher 
productivity, because its output and revenue are increased. But as 
contestants invest more, their costs also rise. That is what limits the 
spread in equilibrium: Firms offering too large a spread induce exces- 
sive investment. A competing firm can attract all of these workers by 
decreasing the spread because investment costs fall by more than 
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expected product, raising expected net earnings. Increasing mar- 
ginal cost of skill implies a unique equilibrium spread between the 
prizes that maximizes expected utility. 

More precisely, consider the contestant’s problem, assuming that 
both have the same costs of investment C(y), so that their behavior is 
identical. A contestant’s expected utility (wealth) is 


Ci CU Wo CG) — EW (1 — F Ws Ce), 
(2) 
where P is the probability of winning. The probability that 7 wins is 


P = prob (4; > qx) = prob (M; — Me > € — €5) (3) 
= prob (uw; — wx > €) = G(e; — px), 
where & = e, — €;,€ ~ g(€), G(-) is the cdf of €, E(é) = 0, and E(é) = 
20? (because e; and e, are i.i.d.). Each player chooses py; to maximize 
(2). Assuming interior solutions, this implies 





oP ; 
(W, ~ W,) 2 — ~ C'(ui) = 0 
Ki 
and 
o?P 7 Hout 
Vas Nar aie (ji) < 0,7 =7, k. (4) 


We adopt the Nash-Cournot assumptions that each player optimizes 
against the optimum investment of his opponent, since he plays 
against the market over which he has no influence. Therefore, j takes 
[, as given in determining his investment and conversely for k. It then 
follows from (3) that, for player j 


OP Op; = OG(w; — Me)/OH;5 = B(Mi — Mx), 


which upon substitution into (4) yields j’s reaction function 
(W, — W2)g(ms — be) — C'(u;) = 9. (5) 


Player k’s reaction function is symmetrical with (5). 

Symmetry implies that when the Nash solution exists, 4; = 4, and P 
= G(0) = %, so the outcome is purely random in equilibrium. Ex ante, 
each player affects his probability of winning by investing.’ 


2 However, it is not necessarily true that there is a solution because with arbitrary 
density functions the objective function may not be concave in the relevant range. It is 
possible to show that a pure strategy solution exists provided that o” is sufficiently large: 
Contests are feasible only when chance is a significant factor. This result accords with 
intuition and is in the spirit of the old saying that a (sufficient) difference of opinion is 
necessary for a horse race. Stated otherwise, since OP/Op; = g(4; — Mx) and g(-) isa pdf, 
&P/du3 = g'(; — Mx) may be positive, and fulfillment of second-order conditions in (4) 
implies sharp breaks in the reaction function. If 0” is small enough the breaks occur at 
very low levels of investment, and a Nash equilibrium in pure strategies will not exist. 
Existence of an equilibrium is assumed in all that follows. 
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Substituting w; = m, at the Nash equilibrium, equation (5) reduces 
to 


C'(mi) = (W, — W2)g(0), 2 =7,k, (6) 


verifying the point above that players’ investments depend on the 
spread between winning and losing prizes. Levels of the prizes only 
influence the decision to enter the game, which requires nonnegativ- 
ity of expected wealth. 

The risk-neutral firm’s realized gross receipts are (q; + q,) ‘ V, and 
its costs are the total prize money offered, W, + W,. Competition for 
labor bids up the purse to the point where expected total receipts 
equal costs W, + W, = (4; + px) ° V. But since w; = wy, = mw in 
equilibrium, the zero-profit condition reduces to 


Vp = (W, + W,)/2. (7) 


The expected value of product equals the expected prize in equilib- 
rium. Substitute (7) into the worker’s utility function (2). Noting that 
P = ¥ in equilibrium, the worker’s expected utility at the optimum 
investment strategy is 


Vw — C(p). (8) 
The equilibrium prize structure selects W, and W, to maximize (8), or 
[V —C'(w)](Ou/OW;) = 0, 1 = 1,2. (9) 


The marginal cost of investment equals its marginal social return, V = 
C’(w), in the tournament as well as the piece rate. Therefore, compet- 
itive tournaments, like piece rates, are efficient and both result in 
exactly the same allocation of resources. 

Some further manipulation of the equilibrium conditions yields an 
interesting interpretation in terms of the theory of agency (see Ross 
Bake Becker and Stigler 1974; Harris and Raviv 1978; and Lazear 
1979): 


W, =Vyu + C'(u)/2¢(0) = Vu + V/2g(0) 
W. =Vyu — C'(w)/2g(0) = Ve — V/2g(0). 


The second equality follows from V = C'(w). Now think of the term 
C'(m)/2g(0) = V/2g(0) in (10) as an entrance fee or bond that is posted 
by each player. The winning and losing prizes pay off the expected 
marginal value product plus or minus the entrance fee. That is, the 
players receive their expected product combined with a fair winner- 
take-all gamble over the total entrance fees or bonds. The appropriate 
social investment incentives are given by each contestant’s attempt to 
win the gamble. This contrasts with the main agency result, where the 
bond is returned to each worker after a satisfactory performance has 


(10) 
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been observed. There the incentive mechanism works through the 
employee’s attempts to work hard enough to recoup his own bond. 
Here it works through the attempts to win the gamble. 

Comparative statics for this problem all follow from (9) and (10) 
once a distribution is specified. For example, if € is normal with 
variance o”, then g(0) = Yoo V 7. It follows from (10) that the optimal 
spread varies directly with V and a”. While several other interesting 
observations can be made of this sort, we note a somewhat different 
but important practical implication of this general scheme. Even 
though the optimal prize structure determines expected marginal 
product through its effect on worker choice of uw and the zero-profit 
condition (7) implies that expected prizes equal expected produc- 
tivity, nevertheless actual realized earnings definitely do not equal 
productivity in either an ex ante or ex post sense. Consider ex ante 
first. Since 4; = Mx = &, expected products are equal. Since W, > W, is 
required to induce any investment, the payment that receives never 
equals the payment that & receives. It is impossible that the prize is 
equal to ex ante product, because ex ante products are equal. Nor do 
wages equal ex post products. Actual product is Vq rather than Vp. 
But gq is a random variable, the value of which is not known until after 
the game is played, while W, and W, are fixed in advance. Only under 
the rarest coincidence would W, = Vq; and W, = Vqx. 

Consider the salary structure for executives. It appears as though 
the salary of, say, the vice-president of a particular corporation is 
substantially below that of the president of the same corporation. Yet 
presidents are often chosen from the ranks of vice-presidents. On the 
day that a given individual is promoted from vice-president to presi- 
dent, his salary may triple. It is difficult to argue that his skills have 
tripled in that 1-day period, presenting difficulties for standard 
theory where supply factors should keep wages in those two occupa- 
tions approximately equal. It is not a puzzle, however, when inter- 
preted in the context of a prize. The president of a corporation is 
viewed as the winner of a contest in which he receives the higher 
prize, W,. His wage is settled on not necessarily because it reflects his 
current productivity as president, but rather because it induces that 
individual and all other individuals to perform appropriately when 
they are in more junior positions. This interpretation suggests that 
presidents of large corporations do not necessarily earn high wages 
because they are more productive as presidents but because this 
particular type of payment structure makes them more productive 
over their entire working lives. A contest provides the proper incen- 
tives for skill acquisition prior to coming into the position. 

3 If ¢ is a fixed effect, there is additional information from knowing the identity of 


winners and losers. The expected productivity of a winner is w + Efe; |q5 > x), while 
that of a loser is 4 + E(e;|q; < 9x). In a one-period contest there is no possibility of 
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Though tournaments and piece rates are substantially different in- 
stitutions for creating incentives, we have demonstrated the surpris- 
ing result that both achieve the Pareto optimal allocation of resources 
when workers are risk neutral. In fact other schemes also achieve this 
allocation. For example, instead of playing against an opponent, a 
worker might be compared with a fixed standard q, with one payment 
awarded if output falls anywhere below q and another, higher, pay- 
ment awarded if output falls anywhere above standard. Attempting to 
beat the standard has the same incentive effects as attempting to beat 
another player. Using the same methods as above, it is not difficult to 
show that there are spread-standard combinations that induce Pareto 
optimum investments. Since all these schemes involve the same in- 
vestment policy, and since average payout by the firm equals average 
product for all of them, they all yield the same expected rewards and, 
therefore, the same expected utility to workers.‘ 

In spite of the apparent equality of these schemes in terms of the 
preferences of risk-neutral workers, considerations of differential 
costs of information and measurement may serve to break these ties in 
practical situations. The essential point follows from the theory of 
measurement (Stevens 1968) that a cardinal scale is based on an 
underlying ordering of objects or an ordinal scale. In that sense, an 
ordinal scale is “weaker” and has fewer requirements than a cardinal 
scale. If it is less costly to observe rank than an individual’s level of 
output, then tournaments dominate piece rates and standards. On 
the other hand, occupations for which output is easily observed save 
resources by using the piece rate or standard, or some combination, 
and avoid the necessity of making direct comparisons with others as 
the tournament requires. Salesmen, whose output level is easily ob- 
served, typically are paid by piece rates, whereas corporate executives, 
whose output is more difficult to observe, engage in contests. 

In a modern, complex business organization, a person’s produc- 
tivity as chief executive officer is measured by his effect on the 
profitability of the whole enterprise. Yet the costs of measurement for 


taking advantage of this information. However, in a sequential contest with no firm- 
specific capital, the information would be valuable and would constrain subsequent 
wage payments in successive rounds through competition from other firms. It is not 
difficult to show that this does not affect the general nature of the bond-gamble 
solution. Alternatively, if the investment has firm-specific elements or firms adopt 
policies that bind workers to it (as in Lazear 1979), these restrictions do not necessarily 
apply. 

* The level of the standard is indeterminate, since for any@ a corresponding spread 
can be chosen to achieve the optimal investment. This is also true of contests among 
more than two players. With N contestants, the prizes of N — 2 of them are indetermi- 
nate. When risk neutrality is dropped, the indeterminacy vanishes in both cases. 
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each conceivable candidate are prohibitively expensive. Instead, it 
might be said that those in the running are “tested” by assessments of 
performance at lower positions. Realizations from such tests are sam- 
ple statistics in these assessments, in much the same way that grades 
are assigned in a college classroom and IQ scores are determined. 
The point is that such tests are inherently ordinal in nature, even 
though the profitability of the enterprise is metered by a well-defined, 
cardinal ratio scale. It is in situations such as this that the conditions 
seem ripe for tournaments to be the dominant incentive contract 
institution. 

Notice in this connection that the basic prize and piece-rate struc- 
tures survive a broad class of revenue functions other than summable 
ones. Even if the production function of the firm includes compli- 
cated interactions involving complementarity or substitution among 
individual outputs, there exists the possibility of paying workers 
either on the basis of individual performance or by rank order. The 
revenue function itself can even involve rank-order considerations, 
and both possibilities still exist. For example, spectators at a horse race 
generally are interested in the speed of the winning horse and the 
closeness of the contest. Then the firm’s (track) revenue function 
depends on the first few order statistics; yet the horses could be paid 
on the basis of their speed rather than on the basis of win, place, and 
show positions. Both methods would induce them to run fast.° 

There has been very little treatment of the problem of tournament 
prize structure and incentives in the literature. Little else but the 
well-known paper by Friedman (1953) based on Friedman-Savage 
preferences for lotteries exists in economics. In the statistics literature 
there is an early paper by Galton (1902) that is worthy of brief 
discussion. Galton inquired into the ratio of first- and second-place 
prize money in a race of n contestants, assuming the prizes were 
divided in the following ratio: 


W/W, = (Q1 — Qs)/(Qz — Qs). 
Here Q, is the expected value of the first- (fastest) order statistic, etc. 
While a moment’s reflection suggests this criterion to be roughly re- 


lated to marginal productivity, Galton proposed it on strictly a priori 
grounds. He went on to show the remarkable result that the ratio above 


5 The reader is reminded that throughout this section and the next workers are 
identical a priori and differ only ex post through the realization of e. In the real world, 
where there is population heterogeneity, market participants are sorted into different 
contests. There players (and horses, for that matter) who are known to be of higher 
quality ex ante may play in games with higher stakes. If it can be accomplished, the 
sorting is by anticipated marginal products. In that sense, pay differences among 
contestants of known quality resemble the effect of a “piece rate.” These issues are 
more thoroughly discussed below. 
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is approximately 3 when the parent distribution of speed is normal. 
Hence, this criterion results in a highly skewed prize structure. From 
what we know today about the characteristic skew of extreme value 
distributions, a skewed reward structure based on order statistics is 
less surprising for virtually any parent distribution. In the more 
modern statistical literature, the method of paired comparisons has 
tournament-like features. Samples from different populations are 
compared pairwise, and the object is to choose the one with the 
largest mean. Comparing all samples to each other is like a round- 
robin tournament. An alternative design is a knockout tournament 
with single or double elimination. The latter requires fewer samples 
and is therefore cheaper, but does not generate as much information 
as the round robin (David 1963; Gibbons, Olkin, and Sobel 1977). 

Galton’s original work and the more modern developments it has 
given rise to are not helpful to us; they deal with samples from fixed 
populations, so the reward structure is irrelevant for resource alloca- 
tion. The problem we have treated here is that of choosing the reward 
structure to provide the proper incentive and elicit the socially proper 
distributions. 


III. Optimal Compensation with Risk Aversion 


All compensation systems can be viewed as schemes which transform 
the distribution of productivity to a distribution of earnings. A piece 
rate is a linear transformation of output, so the distribution of income 
is the same apart from a change in location and scale. A tournament is 
a highly nonlinear transformation: It converts the continuous dis- 
tribution of productivity into a discrete, binomial distribution of in- 
come. When workers are risk neutral, both schemes yield identical 
investments and expected utility because their first moments are the 
same. In this section, it is shown that with risk aversion one method or 
the other usually yields higher expected utility, because the interac- 
tion between insurance and action implies substantially different first 
and second moments of the income distribution in the two cases.® 
We have been unable to completely characterize the conditions 
under which piece rates dominate rank-order tournaments and vice 
versa, but we show some examples here. Truncation offered by prizes 
implies more control of extreme values than piece rates but less 
control of the middle of the distribution. Different utility functions 


* One might think that risks could be pooled among groups of workers through 
sharing agreements, but that is false because of moral hazard. A worker would never 
agree to share prizes since doing so would result in 4 = 0, and consequently E(q; + qx) 
= 0 and bankruptcy for the firm. As a result, firms offering tournaments or piece rates 
in the pure sense yield higher expected utility than the sharing arrangement. 
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weight one aspect more than the other so that tournaments can 
actually dominate piece rates. 


Optimum Linear Piece Rate™ 


The piece-rate scheme analyzed pays workers a guarantee, J, plus an 
incentive, rq, where r is the piece rate per unit of output. The problem 
for the firm is to pick an 7, 7] combination that maximizes workers’ 
expected utility 


max [E(U) = max J U(y)6(y)dy], a) 


where 


ye lrg Cp) 
(12) 
=I+rwt+re—C(p) 


and 6(y) is the pdf of y. 
The worker’s problem is to choose yz to maximize expected utility 
given I and r. If € ~ f(e), the worker’s problem is 


max E(U) = Ur +ru+re —C(p)|f(ede. 
bu 
The first-order condition is 


OE (U) 


Ail = f(U'O)Ilr — C'(w)|f(ede = 0, 


which conveniently factors so that 
r=C'(p). (13) 


Condition (13) is identical to the risk-neutral case, because e€ is inde- 
pendent of investment effort, m. 

Assuming risk-neutral employers, Vz is expected revenue from a 
worker and / + rp is expected wage payments. Therefore, the zero- 
profit market constraint is 


Vu =I+rp. (14) 


Solving (14) for J and substituting into (12), the optimum contract 
maximizes 


fuWp(r) + re — C[p(r) ]} f(e)de 
with respect tor, where ws = (7) satisfies (13). After simplification the 
7 The following is similar to a problem analyzed by Stiglitz (1975). A linear piece-rate 


structure is a simplification. A more general structure would allow for nonlinear piece 
rates (see Mirrlees 1976). 
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marginal condition is 
[V- Cunl Se EU" eel = (15) 


Since risk aversion implies EeU’ < 0, (15) shows that V > C'(y) in the 
optimum contract for risk-averse workers. This underinvestment is 
the moral hazard resulting from insurance J > 0 and r < V implied by 
(15). 

Using familiar Taylor series approximations to the utility function 
and a normal density for e, the optimum is approximated by 


V 
= Ce 16 
oa Feat i) 
and 
V2o2 
ee Lid 
Oy + sC"a2)?’ Ue. 
where s = —U"/U' evaluated at mean income is the measure of abso- 


lute risk aversion. Investment increases (see [16]) in V and decreases 
in s,C”, and o?, because all these changes imply similar changes in the 
marginal piece rate r which influences investment through condition 
(13). The same changes in V, s, and C” have corresponding effects on 
the variance of income (see [17]), but an increase in o? actually 
reduces variance, if a? is large, because it reduces r and increases J.8 


Optimum Prize Structure 
The worker's expected utility in a two-player game is 
E(U) = P{U[W, — C(p*)]} + (1 — P){U[W, — C(p*)]}, (18) 
where * denotes the outcome of the contest rather than the piece-rate 
scheme. The optimum prize structure is the solution to 
max (E(U*) = max {P - U[W, — C(w*)] + (1 — P) - U[W, — C(u*)}}) 
(19) 
subject to the zero-profit constraint 
Vu* = PW, + (1 —P)W,. (20) 


The worker selects x* to satisfy OF (U)/Ou* = 0. Since cost functions 
are the same and e; and e; are i.i.d., the Nash solution implies 1; = wx 


® Furthermore, r = V/(1 + sC"o?) andI = sV?o?/(1 + sC"o?)?, so that r = V andJ = Oin 
the case of risk neutrality (s = 0). All these approximations use first-order expansions 
for terms in U’(-) and second-order expansions for terms in U(-). The same is true of 
the approximations below for the tournament. 
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and P = % as before. Then the worker’s investment behavior sim- 
plifies to 


Cus) = (21) 
where U(r) = U[W, — C(p*)] and U'(r) = U'[W, — C(p*)] for = 1, 2. 
Equation (21) implies 
B* = p*(W,W2), (22) 
and the optimum contract (W,,W,) maximizes 
E(U*) = % U[W, — C(p*)] + 4 ULW, — C(p*)] (23) 


subject to (20), with P = 14, and (22). Increasing marginal cost of 
investment and risk aversion guarantees a unique maximum to (23) 
when a Nash solution exists. Again, assuming a normal density for e, 
second-order approximations yield 


V 
eo (/-1 
Bee ON acre (24) 
and 
Etat Os em. 
oe einai 2 S 
where 


y* {= W, — C(w*) if q5 > Ge 
= W, — C(p*) if 5 < 4x 
and e; ~ N(0,07),e, ~ N(0,07), and cov (ey,€,) —.0,. Lhe comparative 
statics of (24) and (25) are similar to the piece rate (16) and (17) and 
need not be repeated. 


Comparisons 


Equations (16) and (24) indicate that investment and expected in- 
come” are lower for the contest than for the piece rate at given values 
of s. Moreover, for values of a? in excess of UsC'V 7, the variance of 
income in the tournament is smaller than for the piece rate. This 
would seem to suggest that contests provide a crude form of insur- 
ance when the variance of chance is large enough, but the problem is 
significantly more complicated than that because there is no separa- 
tion between tastes and opportunities in this problem: The optimum 


9 Futhermore, C’(u*) = g(0)(W, — W,), so the spread is still crucial for investment 
incentives, as in the risk-neutral case. 

1 Since y = Vp — C(w), and since p is below the wealth-maximizing level of 4 when 
workers are risk averse, lower p implies lower y because revenue falls by more than cost. 
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mean and variance themselves depend on utility-function parameters. 
Thus, for example, for the constant, absolute risk-aversion utility 
function U = —e7*4/s, the insurance provided by the contest is in- 
sufficient to compensate for its smaller mean: It can be shown that the 
expected indirect utility of the optimal piece rate exceeds that of the 
optimal tournament for all values of o?, at least with normal distribu- 
tions and quadratic investment-cost functions. However, when there 
is declining absolute risk aversion, we have examples where the con- 
test dominates the piece rate. 

Illustrative calculations are shown in table 1 using the utility func- 
tion U = ay*, which exhibits constant relative but declining absolute 
risk aversion, s(y) = (1 — a)/y. Again quadratic costs and normal errors 
are assumed. However, this utility function is defined for positive 
incomes only, so an amount of nonlabor income yg is assigned to the 
worker to avoid a major approximation error of the normal, which 
admits negative incomes (i.e., the possibility of losses). 

Table 1 shows that when yy) = 100 so that s = .005, the contest is 
preferred until o? > 3. However, if yy = 25 so that s = .020, the 
contest is only preferred for o? < .2. The intuition is that piece rates 
concentrate the mass of the income distribution near the mean, while 
contests place 50 percent of the weight at one value significantly below 
the mean and the other value significantly above. Strongly risk-averse 
workers seem to dislike the binomial nature of this distribution when 
o” is high because it concentrates too much of the mass at low levels of 
utility. However, when o? is small, the contest which truncates the tails 


TABLE 1] 


CONSTANT RELATIVE Risk AVERSION 








o? ra yw E(U) E(U*) 





Yo = 100; s(y9) = .005 











ll .9995 .9984 5.012155 5.012465 
5 .9975 .9922 5.012150 5.012445 

1 .9950 .9846 5.012100 5.012295 
3 .9852 -9552 5.011940 5.011925 
6 .9710 .9142 5.011800 5.011415 
12 -9436 .8420 5.011420 5.010515 

Yo = 25; (yo) = .020 

all .9980 .9938 2.524665 2.524725 

22 .9960 .9878 2.524616 2.524575 

] .9807 .9419 2.524237 2.523437 
12 .8094 574] 2.519930 2.514282 





Note.—U = ay; y = yo + I + rq — C() for piece rate; y = yo + W, — C(x) for contest (i = 1, 2);a= .5,V= 1,C(p) 
= w/2: 0. 
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of the income distribution associated with a linear piece rate has 
higher value. 


Income Distributions 


While it is not possible to make a general argument based on an 
example, table 1 suggests that persons with more endowed income 
and smaller absolute risk aversion are more likely to prefer contests, 
and those with low levels of endowed wealth and larger absolute risk 
aversion are more likely to prefer piece rates. Consider a situation in 
which all persons have the same utility function, such as the one in 
table 1, and face the same costs and luck distribution, the only dif- 
ference being the fact that some workers have larger endowed in- 
comes than others. If this difference is large enough, it can be optimal 
to pay piece rates to those with small values of endowed income and to 
pay prizes to those with large values. Individuals will self-select the 
payment scheme in accordance with their wealth. The distribution of 
earnings among those selecting the piece-rate jobs is normal with 
mean Vy and variance 1’a”. It is binomial with mean V p* and vari- 
ance (AW)?/4 for those who enter tournaments. Note that yz and p* 
depend upon s(y), which is smaller for workers who select contests, 
and it can turn out as it does in table 1 that expected income is larger 
in the contest than in the piece rate; for example, if ¢? = 1 then the 
rich prefer contests (5.012295 > 5.012100) and the poor prefer piece 
rates (2.524237 > 2.523437), but u* = .9846 exceeds x = .9807. This 
situation is shown in figure 1. 

The overall distribution is the sum of a binomial and a normal with 
lower mean, weighted by the number of individuals in each occupa- 
tion (see fig. 1). It is positively skewed because V u* > Vw. Note also 
that the distribution of wage income will be less skewed than that of 
total income. The reason is that yy) and mean-wage income are posi- 
tively correlated because the likelihood of choosing a contest increases 
with yo. These implications conform to the standard findings on the 
distribution of income in an economy. 

This example is interesting because it is very closely related to some 
early results of Friedman (1953), who studied how alternative social 
arrangements can produce income distributions that cater to workers’ 
risk preferences. He showed that the Friedman-Savage utility func- 
tion leads to a two-class distribution. Persons in the risk-averse region 
are assigned to occupations in which income follows productivity, 
while persons in the risk-preferring region buy lottery tickets in very 
risky occupations in which few win very large prizes. The overall 
distribution is the sum of these two and exhibits characteristic skew. 
The Friedman-Savage utility function implies that a person’s risk 
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fly) 


Yo+ Vi W,+Y6 ygtVqu" Wit+yo Income 
Fic. 1 


preferences depend on the part of his wealth that is not at risk. 
Therefore, Friedman’s assignment of people to jobs really follows 
endowed wealth (yo), just as in our example. However, our framework 
offers two improvements. First, the problem of incentives 1s directly 
incorporated into the formulation of the optimum policy. Second, 
workers in this model are risk averse for all values of incomes, but 
even so gambles can be the optimal policy. 


Error Structure 


Relative costs of measurement are still important in choosing among 
incentive schemes, but the error structure plays additional roles when 
workers are risk averse. Suppose the output estimator for worker? in 
activity 7 is Gir = dir + P; + Vir, Where v;, is random error and p, is an 
error that is specific to activity r but common to all workers within that 
activity. In the piece rate the common error p adds noise which 
risk-averse workers dislike, while the common noise drops out of a 
rank-order comparison because it affects both contestants similarly. 
That is, the relevant variance for the contest is 202, while that for the 
piece rate is 03 + o3. It is evident that this can tip the balance in favor 
of tournaments if o3 is large enough and/or workers are sufficiently 
risk averse. 

The common error p bears two interesting interpretations. One is 
activity-specific measurement error. For example, j and k may have 
the same supervisor whose biased assessments affect all workers simi- 
larly. This is similar to monitoring all workers by a mechanical count- 
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ing device that might run too fast or too slow in any given trial. The 
other interpretation of p is true random variation that affects the 
enterprise as a whole. For example, suppose all firms produce with 
the same technology, but that in a given period some firms do better 
or worse than others. Then risk-averse workers prefer not to have 
their incomes vary with conditions facing the firm as a whole, and 
wages based on a contest eliminate this kind of variation. Without its 
elimination there would be excessive losses due to moral hazard. 

It must be pointed out that, in the absence of measurement error, 
using a contest against a fixed standard q discussed above has lower 
variance than playing against an opponent. As shown in Section II, 
the relevant variance in a contest is that of € = e, — €;, which has 
variance 20° against an opponent and only o? against a standard 
(since the standard is invariant, e, = 0). Consequently, we might 
expect risk-averse workers to prefer absolute standards." Again, 
however, the crucial issue is the costs of measurement and the error 
structure. For the complex attributes required for managerial posi- 
tions, it is difficult to observe output and therefore difficult to com- 
pare to an absolute standard. Insofar as samples and tests are neces- 
sary, it bears repeating that these are inherently ordinal in nature. But 
this leads us back to the problem of common error, where it is often 
impossible to know whether a person’s output is satisfactory without 
comparisons to other persons. Further, when there are changing 
production circumstances in the firm as a whole, it is difficult to know 
whether the person failed to meet the standard because of insufficient 
investment or because the firm was generally experiencing bad times, 
a problem of measuring “value added.” Risk-averse workers increase 
utility by competing against an opponent and eliminating this kind of 
firm effect. 


IV. Heterogeneous Contestants 


Workers are not sprinkled randomly among firms but rather seem to 
be sorted by ability levels. One explanation for this has to do with 
complementarities in production. But even in the absence of com- 
plementarities, sorting may be an integral part of optimal labor- 
contract arrangements. Informational considerations imply that 


1 Playing against a standard is like Mirrlees’s (1976) notion of an “instruction.” It is 
clear that using standards as well as piece rates must be superior to using one alone. That 
scheme would allow workers to be paid J if gq < g and J, + rq for q 2 q. This is important 
because it truncates the possibilities when Vq < 0. Given the technology, it is possible 
that very large negative values of output can occur, and since it is impossible to always 
tax workers the full extent of this loss, some form of truncation is desirable. A contest is 
an alternative way to control the tails of this distribution. 
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compensation methods may affect the allocation of worker types to 
firms. Therefore, this section returns to the case of risk neutrality and 
analyzes tournament structures when investment costs differ among 
persons. Two types of persons are assumed, a’s and b’s, with marginal 
costs of the a’s being smaller than those of the b’s: Ca(“) < C,(#) for all 
p. The distribution of disturbances f(€) is assumed to be the same for 
both groups. Many of the following results continue to hold, with 
usually obvious modification of the arguments, if the a’s and b’s draw 
from different distributions. The following section addresses the 
question of self-selection when workers know their identities but firms 
do not. The next section discusses handicapping schemes when all 
cost-function differences can be observed by all parties. 


Adverse Selection 


Suppose that each person knows to which class he belongs but that 
this information is not available to anyone else. The principal result is 
that the a’s and b’s do not self-sort into their own “leagues.” Instead, 
all workers prefer to work in firms with the best workers (the major 
leagues). Furthermore, there is no pure price-rationing mechanism 
that induces Pareto optimal self-selection. But mixed play is 
inefficient because it cannot sustain the proper investment strategies. 
Therefore, tournament structures naturally require credentials and 
other nonprice signals to differentiate people and assign them to the 
appropriate contest. Firms select their employees based on such in- 
formation as past performances, and some are not permitted to com- 
pete. 

The proof of adverse selection consists of two parts. First we show 
that players do not self-sort into a leagues and 6 leagues. Second, we 
show that the resulting mixed leagues are inefficient. 

1. Players do not self-sort.—Assume leagues are separated and con- 
sider the expected revenue R; generated by playing in league i = a, b 
with an arbitrary investment level w. Then 


Ri(u) = Ws + (Wi — W3)P*, i =a,), (26) 


where (W{,W3) is the prize money, and P* is the probability of winning 
in league 7. Recall that P' depends on the individual’s level of invest- 
ment and that of his rivals. Therefore, P? = G(u — wx) and P® = G(w 
— Ms), where yz is the existing players’ investments in the a league, 
where V = C;(u#), and similarly for p#. Recalling from (6) and (9) that 
W;, — Wi = V/g(0) and from (10) that Wi = Vu — V/2¢(0), equation 
(26) becomes 


LOTR G GE tne | (27) 


R; ad V i 
() be z(0) 
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Note that Rj(w) = Vu* when pw = p¥ and that dRj/dp = Ri(p) = Ve(w 
— pi*)/g(0) > 0. Since Ci(u*) = V and C}(u#) = V, then pi < p* so that 
Ri(ué) > Ri(u#). Furthermore, Rj[w — (u* — L*)] = Ri(w). Therefore, 
R,(“) is a pure displacement of R,(u). Since R}(u) = V for pw = pe and 
R;j(u) < V elsewhere, and since R;() is increasing, the revenue func- 
tions never cross. So R,() lies to the southwest of Taipei (see fig. 2): 
Therefore, independent of cost curves, it is always better to play in the 
a league than the b league: Workers will not self-select. 

2. Mixed contests are inefficient.—Suppose the proportions of a’s and 
b’s in the population are @ and (1 — a), respectively. If pairings among 
a’s and b’s are random, then expected utility of a player of type i is 


W, + [aPi, + (1 — a)PiW, — W,) — Cu), 


where (W,,W.) is the prize money in mixed play and P} is the proba- 
bility that a player of type 2 defeats a player of type j. The first-order 
condition for investment of type 7 in this game is 








5 OP?, leer} oP 


Db) = = 9 
a dan | A ~ Wa) = Ch. 
A development similar to Section II implies equilibrium reaction 
functions 


Lag(0) + (1 — @)g(a — Bo) (Wi — We) = Cia) 


R(u) 
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for a’s and 
[agp — a) + (1 — a)g(0)(W, — We) = Cro) 


for b’s. If the solution is efficient, then Cj(u,) = V = Ci(pa), which 
implies 


ag(0) + (1 — a)g(@a — Be) = Og > — Ma) + (1 — a) g(0). 


Since g is symmetric and nonuniform, this condition can hold only if @ 
= 4. Therefore, except in that very special case, mixed contests yield 
inefficient investment: One type of player overinvests and the other 
underinvests depending upon whether or not a 2 2. 

We conclude that a pure price system cannot sustain an efficient 
competitive equilibrium in the presence of population heterogeneity 
with asymmetric information. Markets can be separated, but only at a 
cost. Consider, for example, the case where a’s want to prevent b’s 
from contaminating their league. By making the, spread: W/o. — V3, 
sufficiently large, R,() becomes steeper than R,(w) in figure 2 and 
crosses it so that the envelope covers R,() at low values of w and 
R,() at high values. Then, for some high levels of mw, it is more 
profitable to play in the a league and, for low levels of yw, the b league 
is preferable. Individuals may self-sort, but the cost is that a’s overin- 
vest. The result is akin to that of Akerlof (1976) and to those of 
Spence (1973), Riley (1975), Rothschild and Stiglitz (1976), and Wil- 
son (1977). As they show, a separating equilibrium need not exist, 
but, even if it does, that equilibrium may be inferior to a nonseparat- 
ing equilibrium. 

The obvious practical resolution of these difficulties is the use of 
nonprice rationing and certification to sort people into the appro- 
priate leagues based on past performance. Similarly, firms use non- 
price factors to allocate jobs among applicants. The rules for allocat- 
ing those jobs may be important for at least two reasons that we can 
only briefly describe here. 

First, sorting workers of different skill levels into appropriate posi- 
tions within a hierarchy may be beneficial. In this paper, production is 
additive, so it does not matter who works with whom. To the extent 
that the production technology is somewhat more complicated, sort- 
ing may well be crucial. A series of pairwise, sequential contests may 
efficiently perform that function. Suppose that gy = wi + 6; + nit, 
where 6; is an unobserved ability component for player7 and 7 is white 
noise. Suppose it is efficient for the individual with the highest 6 to be 
the chief executive. There will be a tendency to have winners play 
winners because 


E(8;| qi: > Ges E(6n| Gn > Ged) 
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in the first round. A sequential elimination tournament may be a 
cost-efficient way to select the best person. 

Second, workers may not know precisely their own abilities or cost 
functions. A worker who is ignorant about his cost function values 
information before selecting a level of investment expenditure. 
Therefore, firms may offer “tryouts” to provide information about 
optimal investment strategies. In fact, one can imagine the existence 
of firms which specialize in running contests among young 
workers—the minor leagues—which provide information to be used 
when and if the workers opt to increase the stakes and enter a bigger 
league. 

These issues point up an important difference between piece rates 
and contests. In the pure heterogeneous case, where information is 
asymmetric and workers are risk neutral, a piece rate always yields an 
efficient solution, namely, V = C(t) = Cj(pp). However, once slot- 
ting of workers is important because of complementarities in produc- 
tion, or if it is desirable for workers to gain information about their 
type, it is no longer obvious that a series of sequential contests does 
not result in a superior allocation of resources. 


Handicap Systems 


This section moves to the opposite extreme of the previous discussion 
and assumes that the identities of each type of player are known to 
everyone. Competitive handicaps yield efficient mixed contests. 

Consider again two types a and 6 now known to everyone. Prize 
structures in a-a and b-b tournaments satisfying (11) and (12) are 
efficient, but those conditions are not optimal in mixed a-b play. 
Denote the socially optimal levels of investment by we and yp}, their 
difference by Ap, and the prizes in a mixed league by W, and W,. Let 
h be the handicap awarded to the inferior player b. Then the Nash 
solution in the a-b tournament satisfies 


g(a — My — h)AW = Ci (pa) (28) 
and 
£ (Ha — My — h)AW = Cy(my). 


(The second condition in [28] follows from symmetry of g[€].) Since 
the efficient investment criterion is V = Ci(yux) = C;(u*), independent 
of pairings, the optimum spread in a mixed match must be 


AW = Vig(Au — hy). (29) 


From (28), condition (29) insures the proper investments by both 
contestants. The spread is larger in mixed than pure contests unless a 
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gives 6 the full handicap h = px — pi. Otherwise, the appropriate 
spread is a decreasing function of h. ae W, and W, must also satisfy 
the zero-profit constraint W,+W,=V - (ux + ux) independent of h 
since the spread is always adjusted to ae investments ux and ps. 
The gain to an a from playing a 6 with handicap h, rather than 
another a with no handicap, is the difference in expected prizes: 


Yalh) = PW, + (1 — P)W, — Ca(ue) — (WY + W3)/2 — Ca(me)] (30) 
= PW,+(1—P)W, — (W% + W3)/2, 


where y,(A) is the gain toa and P =G(Ap — h) isthe probability that a 
wins the mixed match. The corresponding expression for 6 is 


yp(h) = (1 — P)W, + PW, — (W® + W3)/2. (31) 


The zero-profit constraints in a-a, a-b, and b-b require that y,(h) + 
y,(h) = 0 for all admissible h. The gain of playing mixed matches to a 
is completely offset by the loss to b and vice versa. 

If C,() is not greatly different from C,(m), then Au = wx — pF is 
small and P = % + [g(Ap — h)](Aw — h). This approximation and 
the zero-profit constraint reduce (30) to 

ee ae - hj, (32) 
The expression for y,(h) is the same, except its sign is reversed, so the 
gain to a decreases inh, and the gain to b increases in h. Therefore, h* 
= Ap/2 is the competitive handicap, since it implies yq(h*) = y,(h*) = 
0. If the actual handicap is less than h*, then y, is positive and a’s 
prefer to play in mixed contests rather than with their own type, while 
b’s prefer to play with b’s only. The opposite is true if h > h*. 

A two-player game is said to be fair when the players are handi- 
capped to equalize the medians. The competitive handicap does not 
result in a fair game, since h* = Ap/2 < Aw. The a’s are given a 
competitive edge in equilibrium, because they contribute more to total 
output in mixed matches than the b’s do. This same result holds if €, 
has a different variance than €,, but it may be sensitive to the assump- 
tion of statistical independence and output additivity. 

Alternatively, h can be constrained to be zero. In this case, different 
wage schedules would clear the market. Since Ya(0) = = ~y(0) = B, 
paying W, — B and W, — B toa’s, while paying W, + B, W2 + B to b’s, 
leaves the spread and, therefore, the investments unaltered. It is easy 
to verify that a’s and 6’s are still indifferent between mixed and pure 
contests, because expected returns are equal between segregated and 
integrated contests for each type of player. With no handicaps, the 
market-clearing prizes available to a’s in the mixed contest are lower 
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than those faced by 0’s. Still, expected wages are higher for a’s than b’s 
in the mixed contest, because their probability of winning is larger. 
The ’s are given a superior schedule in the mixed contest as an 
equalizing difference for having to compete against superior oppo- 
nents. This yields the surprising conclusion that reverse discrimina- 
tion, where the less able are given a head start or rewarded more 
lucratively if they happen to accomplish the unlikely and win the 
contest, can be consistent with efficient incentive mechanisms and 
might be observed in a competitive labor market. 


V. Summary and Conclusions 


This paper analyzes an alternative to compensation based on the level 
of individual output. Under certain conditions, a scheme which re- 
wards rank yields an allocation of resources identical to that gener- 
ated by the efficient piece rate. Compensating workers on the basis of 
their relative position in the firm can produce the same incentive 
structure for risk-neutral workers as does the optimal piece rate. It 
might be less costly, however, to observe relative position than to 
measure the level of each worker’s output directly. This results in 
paying salaries which resemble prizes: wages which differ from 
realized marginal products. 

When risk aversion is introduced, the prize salary scheme no longer 
duplicates the allocation of resources induced by the optimal piece 
rate. Depending on the utility function and on the amount of luck 
involved, one scheme is preferred to the other. An advantage of a 
contest is that it eliminates income variation which is caused by factors 
common to workers of a given firm. 

Finally, we allow workers to be heterogeneous. This complication 
adds an important result: Competitive contests do not automatically 
sort workers in ways that yield an efficient allocation of resources 
when information is asymmetric. In particular, low-quality workers 
attempt to contaminate firms composed of high-quality workers, even 
if there are no complementarities in production. Contamination re- 
sults in a general breakdown of the efficient solution if low-quality 
workers are not prevented from entering. However, when player 
types are known to all, there exists a competitive handicapping 
scheme which allows all types to work efficiently within the same firm. 
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An Exploration in the Theory 
of Exchange-Rate Regimes 





Elhanan Helpman 
Tel-Aviv University 


Three exchange-rate regimes—a float, a one-sided peg, and a 
cooperative peg—are evaluated and compared in terms of efficiency 
and welfare levels. The framework of analysis embodies country- 
specific monies, with the money of each country being used to 
transact in its commodity markets and its currency-denominated 
bonds. Welfare levels depend only on consumption levels. In the 
presence of perfect foresight all equilibrium allocations are Pareto 
efficient. In a floating exchange-rate regime the perfect foresight 
equilibrium allocation coincides with an equilibrium of a costless 
barter economy. The same result holds in a one-sided peg if the 
pegging country’s exchange-rate stabilizing authority breaks even 
over time. In a cooperative peg regime there is a different equilib- 
rium allocation for each combination of exchange-rate levels and 
monetary policies. Problems of policy coordination and conflicts in 
desired monetary policies are discussed. 


I. Introduction and Summary 


The literature on international economics lacks systematic compari- 
sons of welfare levels that are attained in different exchange-rate 
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systems. Nevertheless, this type of information is essential for a well- 
founded opinion about the relative desirability of alternative 
exchange-rate regimes, like fixed as opposed to floating. The need for 
research in this area can hardly be overemphasized, and it is more 
surprising that only very recently have there been attempts to provide 
such comparisons. 

It is clear that in any analysis of exchange-rate systems the 
specification of the role of money will play a major role because, after 
all, an exchange rate is a relative price of two monies. It is, however, 
unsatisfactory to model money as an asset which serves only as a store 
of value, as is frequently done, because in this case money is just 
another asset without intrinsic features. An explicit modeling of these 
roles is required for a proper evaluation of the relative desirability of 
alternative exchange-rate regimes. 

The present investigation is based on a specification of the role of 
money. Money can be used as a store of value, but it also has to be 
used in every market transaction. Purchases of commodities require 
payments in money, lending requires money to be transferred from 
the lender to the borrower, and debt repayments require money to be 
transferred from the debtor to the creditor. This is, of course, an 
application of the constraint transactions approach to the demand for 
money that was suggested by Clower (1967) (for a recent application 
see Lucas [1980a]). Most important, since I deal with more than one 
country—each country having its own money—it is assumed that for 
purchases of a particular country’s goods one has to pay with that 
country’s money. (This specification is also adopted by Stockman 
[1980], Helpman and Razin [1980], and Lucas [1980b].) Moreover, 
particular currency-denominated loans are given by providing the 
borrower with this currency. Debt repayments are done with the 
currency in which the debt is denominated. This specification of 
the transactions role of money introduces natural liquidity constraints 
into the economic system, and it clearly distinguishes money from 
other financial assets. 

My framework of analysis, which incorporates money in the 
above-described way into an intertemporal model in which utility 
levels depend only on consumption levels, is presented in Section II. 
In the remainder of the paper the model is used to investigate three 
types of exchange-rate regimes: a floating exchange-rate regime and 
two types of fixed exchange-rate regimes. I discuss the implications of 
these regimes about global efficiency and about the distribution of 
welfare, as well as the determination of financial variables like prices, 
exchange rates, and interest rates. The emphasis is on financial as- 
pects of exchange-rate regimes. 

The fixed exchange-rate regimes that are considered are a one- 
sided peg and a cooperative peg. The one-sided peg is a regime in 
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which one country pegs its exchange rate vis-a-vis another currency, 
and the foreign exchange authority of the pegging country engages in 
foreign borrowing (lending) in order to finance its foreign exchange 
operations. The foreign exchange authority is constrained to repay its 
debts. In the cooperative regime there is a foreign exchange authority 
which is supported by all countries. This means that whenever it 
needs to sell a currency of a particular type, the country whose 
currency is required will provide the currency to the foreign ex- 
change authority. There are important differences in the implications 
of these systems. At this stage I wish only to point out that, in the 
former, balance-of-payments adjustments do not change the quantity 
of money in circulation of the passive country but may change the 
quantity of money in circulation of the pegging country. This means 
that the value of money in circulation in the world economy may 
change as a result of balance-of-payments adjustments. In the latter 
system the aggregate value of money supply does not change as a 
result of balance-of-payments adjustments, which make it fall in line 
with recent modelings of the process of international adjustment 
under a fixed exchange rate (see, e.g., Mundell 1968, chap. 18). 

Following the description of the framework of analysis, I discuss in 
Section III some general properties of equilibria. Most notably, it is 
shown that in the presence of perfect foresight the equilibrium allo- 
cation of consumption is Pareto efficient in every exchange-rate re- 
gime. The assumption of perfect foresight is extremely important for 
this result. It requires that governments specify their policies in ad- 
vance and do not change them. For example, surprising monetary 
changes may destroy the efficiency of the resulting equilibrium. Al- 
though perfect foresight implies efficiency, the distribution of welfare 
depends on the exchange-rate system. 

Section IV is devoted to a discussion of a floating exchange-rate 
regime. It is shown that in the presence of perfect foresight, and 
independently of monetary policies, the equilibrium allocation of 
consumption coincides with an equilibrium of a costless barter econ- 
omy. Monetary policies which are specified in advance do not affect 
equilibrium consumption levels. It is clear from this result that in an 
equilibrium the liquidity constraints do not impose special constraints 
on the allocation of real resources. They do, however, help to de- 
termine nominal variables—prices, exchange rates, and interest 
rates—as is explained in that section. 

The one-sided peg regime is discussed in Section V. It is shown 
that, provided the foreign exchange authority breaks even, in this 
regime too the resulting perfect foresight equilibrium allocation of 
consumption coincides with an equilibrium of a costless barter econ- 
omy. Hence, a floating exchange-rate system and a one-sided peg 
have the same efficiency and distributional implications. There are, 


868 JOURNAL OF POLITICAL ECONOMY 


however, also important differences. In particular, in the one-sided 
peg regime there is only one exchange-rate level which enables the 
foreign exchange authority to break even in the face of a given policy 
of monetary transfers (taxes). Hence, there is need for coordination 
of the exchange-rate level with the required accompanying 
monetary-fiscal policy mix. Coordination of policies is not required in 
a floating exchange-rate regime. 

Section VI discusses the cooperative fixed exchange-rate regime. In 
this regime there is no need for policy coordination. But the perfect 
foresight equilibrium allocation of consumption and the distribution 
of welfare levels depend on the exchange-rate level and the 
monetary-fiscal policy mix that each country adopts. Only for very 
particular policy specifications will the cooperative peg equilibrium 
allocation of consumption coincide with the equilibrium allocation of 
consumption in a barter economy. 

There are several interesting directions in which the present study 
can be extended in order to shed more light on the relative desirabil- 
ity of alternative exchange-rate regimes. Some of them are discussed 
in Section VII. 


II. The Framework of Analysis 


The world that I consider consists of two countries, a home (A) 
country and a foreign (F) country. Every country produces a single 
output, and the two outputs are perfect substitutes in consumption. 
In every period a country faces a given unalterable level of output. 
Time is discretely measured by time periods. 

Neither the assumption of a single output nor the assumption of 
perfect substitutability in consumption is restrictive for the purpose of 
this paper; the main results do not depend on these assumptions. 
However, the assumption that output levels are unalterable does not 
enable me to discuss employment problems which are of major con- 
cern in discussions of exchange-rate systems (see, e.g., Friedman 
1953; Flanders and Helpman 1978). 

Every country has its own money. Purchases of the home country’s 
goods are paid for with the home country’s money; purchases of the 
foreign country’s goods are paid for with the foreign country’s 
money. Firms sell goods and transfer the profits to their stockholders. 
Home country firms are owned by home country residents while 
foreign country firms are owned by foreign country residents. As- 
suming that there are no costs of production, the value of a country’s 
output in terms of its own money is transferred by firms to its resi- 
dents.! 


* Since I will not deal with uncertainty elements and I will assume the existence of 
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The precise specification of money flows is important. Money is 
needed for transacting in the market, and it imposes liquidity con- 
straints on economic agents. One cannot buy goods unless one has the 
money to pay for them, and firms cannot distribute dividends unless 
they have the money to do so; all payments are done in the form of 
money. It is assumed, therefore, that firms distribute dividends at the 
end of each period. During a single period a firm accumulates money 
from sales of its goods, and it distributes this money to its owners at 
the end of the period. 

An individual arrives at the beginning of period ¢ with money 
balances that he left in his possession in period ¢ — 1 (these may be 
home country as well as foreign country money balances) and with 
money balances that he has received from firms via dividend pay- 
ments. At the beginning of period ¢ he has to repay his short-term 
debts (debts may be positive or negative) that were acquired in period 
t — 1, including interest payments; these may be debts denominated 
in the home or foreign country’s currency. 

The individual can freely exchange home currency for foreign 
currency and vice versa. In a fixed exchange-rate regime this is done 
with the Exchange Rate Stabilizing Authority (ERSA for short), while 
in a floating exchange-rate regime this is done in the foreign ex- 
change market. The individual can also borrow (lend) home or 
foreign country money at the beginning of period ¢. The completion 
of these transactions leaves him with certain quantities of home and 
foreign money as well as home and foreign currency—denominated 
debts (some of these may, of course, be zero). Now, part or all of the 
money balances can be spent on goods during period t—home cur- 
rency on home goods and foreign currency on foreign goods—while 
the remaining part can be kept in order to increase next period’s 
purchasing power. Thus, part of the total money holdings can be 
“money to spend” and part of it can be “money to hold” (see Hicks 
1967). 

In this paper I consider the case in which there are no restrictions 
on borrowing and lending behavior, except that debts have to be 
repaid. Foreign country decision variables are denoted by an asterisk, 
while home country decision variables are denoted without an as- 
terisk. The world economy faces a horizon of length T—the same for 
both countries.? 


bonds, there is no need to allow trade in ownership shares in firms, because such trade 
adds no new real trading opportunities. 

2 T assume a finite horizon. It is, however, easy to see from the following analysis that 
the results do not change when the horizon is infinite, provided equilibria exist. In 
order to analyze the infinite horizon case one has to replace constraints (1f) and (2f) 
with the requirement that the present value of debts is nonpositive for T going to 
infinity (see, e.g., Helpman and Razin 1981). 
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The home country’s representative consumer solves the following 
problem: 


Choose 
{Cut» Cres Mir, Mr Buv Bre}t=r (1) 
to maximize 
Un(Cy + Cri, CHa + Cro,.--, Cyr + Crr), 
subject to 
Cut» Crt; My, Mr, = O,7 ed ae eee ee (la) 
Min + Mr, S My + By, + €:Brp, + Xm, (1b) 
Puta < Mu, hima 82's waged, (1c) 
Pricer = Mri, freely 2 ere le (1d) 


Mi: + eMr S (Mins — Put—1CHt-1) + e(Mr1 — Prt-1Crt—1) 
+ Prayer — A tae) Bae — tee) C12) 
6: Brit Baek 6B Age, bS20- S58 «sts Lh 
Bir, Br = 0, (1f) 


where 


Uy = home country utility function that is strictly increasing in all 
arguments, 
Cut = consumption of home output in period f, 
Cr_ = Consumption of foreign output in period ¢, 
M} = beginning-of-period holdings of home money in period t, 
My; = beginning-of-period holdings of foreign money in period t, 
By, = home currency borrowing (lending when negative) at the be- 
ginning of period ¢, 
By; = foreign currency borrowing (lending when negative) at the 
beginning of period ¢, 
My = the initial stock of home country money, 
Xi = government transfers (taxes if negative) to home country resi- 
dents in terms of home money at the beginning of period ¢, 
é, = the exchange rate in period ¢, defined as the price of foreign 
currency in terms of the home country’s currency, 
the price of home output in terms of home currency in period 
t 
pr: = the price of foreign output in terms of foreign currency in 
period ¢, 
Yat = home output in period ¢, 


Pu 
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ty¢= the nominal one-period interest rate on period t home 
currency—denominated debt, and 

tee = the nominal one-period interest rate on period ¢ foreign 
currency—denominated debt. 


At the beginning of the first period, home country residents have 
My units of home currency, and they receive a transfer of Xj, units of 
home currency from their government. They borrow By, units of 
home money and B,, units of foreign money. Then, they reallocate 
total money holdings between home and foreign currency via the 
foreign exchange market in a floating exchange-rate regime or the 
ERSA in a fixed exchange-rate regime. This way they end up holding 
M,, units of home money and Mp, units of foreign money. No further 
financial transactions are needed during the first period. Consumers 
spend part or all of their money holdings on goods during the period. 
Thus, total beginning-of-period home money holdings are composed 
of PyiCy; Units of money to spend and My, — pyycy, units of money to 
hold. This is true for foreign money holdings as well. 

At the beginning of the second period, home consumers have Ma 
— Pim units of home currency and M,, — prim: units of foreign 
currency. They receive dividend payments which consist of Pum 
units of home currency and government transfers which consist of 
Xy2 units of home currency. They repay their first-period debts, 
including interest payments—(1 + 2,,)B,, in home currency and (1 + 
tp1)Bp, in foreign currency—and acquire new debts—B,. in home 
currency and Bry in foreign currency. They can make instantaneous 
transactions in the foreign exchange market or with the ERSA in 
order to coordinate the momentary money flows. Finally, they reallo- 
cate total money holdings between home and foreign money—M yy 
and M;».. And the process continues. 

We may now turn to the decision problem of the foreign country’s 
representative consumer. His decision variables are denoted by an 
asterisk, the initial stock of foreign money is denoted by Mp, foreign 
output in period ¢ is denoted by y,, and government transfers (taxes if 
negative) to foreign residents in terms of foreign money at the begin- 
ning of period t are denoted by X;,. All other variables are as defined 
above (except for the asterisk). 


Choose 
{chit Chr, Mii, Mi, Bi, BE hia (2) 


to maximize 


Up (cH, + c#,, cH + Ches..., chp + CBr), 
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subject to 
Cit, ch, Mijn Mi, = 0, L= i Ze rons Le (2a) 
_Min_ +4 M#, <M; +h + Ba, ee (2b) 
e4 1 
PutCiit S Mit, LS l, 2 eel c (2c) 
pri: < Mit, i — tle oan ee te (2d) 


Mit + Mi, < (Miit-1 — Pues) + (M¥_. — Dee Cree) 
t 


et 





+ preayre—1 — (1 + tae-1) ats — (1+ in )Bi (2e) 


+ Bh + Bg, +Xn,t=2,8,.-.,T, 
t 


Bir, BE, < 0. (2f) 


This is the foreign counterpart of the home country’s decision prob- 
lem (1). 


III. A Fundamental Property of Equilibria 


The purpose of this section is to show that all perfect foresight 
equilibria, independently of the exchange-rate regime, are Pareto 
optimal. This is an extremely strong result because it implies that in 
the present framework the exchange-rate regime is irrelevant from 
the efficiency viewpoint; that is, there is no exchange-rate regime that 
is more efficient than other exchange-rate regimes. This does not 
mean, of course, that exchange-rate regimes are irrelevant from the 
viewpoint of other economic considerations. For example, it may 
happen that different exchange-rate regimes have different distribu- 
tional implications in the sense that in some of them a particular 
country is better off than in others. I will address this issue in the 
following sections when discussing particular exchange-rate regimes. 
At this stage I should like only to point out that the equivalence result 
depends strongly on the ability of governments to pursue nondis- 
torting absorption policies whenever they are needed in order to 
accompany an exchange-rate regime. I will have more to say about 
this in Section V. 

First, independently of the exchange-rate system, in an equilibrium 
the interest rate and purchasing power parity conditions have to be 
satisfied. In order to have a solution to (1) and (2) with finite values of 
Buy Bry BE, BR, t = 1, 2,...,1 — 1 (ast-period debt is always zero 
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_due to [lf] and [2f]), there has to be interest rate parity in every 
period: 





eae eet ee a2 er. Tel. (3) 
“1b 
For if (3) does not hold, an investor can arbitrage between foreign and 
domestic debt so as to make infinite profits. One can use problems (1) 
and (2) to show that when (1 + tut) < (1 + tpy)(€,4,/e,) the solutions to 
these problems yield By, Bij, > +0 and By,, BE, > —x, and when 
(1 + ty) > (1 + trp)(e:4,/e,) the solutions to these problems yield Bj, 
De ier NUE Be oo These asset demands are, of 
course, inconsistent with an equilibrium. 

Second, in order to have equilibrium in commodity markets there 
has to be purchasing power parity in every period: 


Put ie erp rt, = e Zs elas coi Le (4) 


For if Pur > e¢pm, the solutions to (1) and (2)—taking into account 
(3)—yield zero demand for the home country’s output in period ¢, 
which is inconsistent with the clearing of commodity markets. Simi- 
larly, if Pir < epr, demand for foreign output drops to zero. 

In order to prove the main result of this section, it is useful to 
rewrite the consumer problems so as to replace the sequence of 
budget constraints by a single wealth constraint. This can be done in 
the present context despite the existence of liquidity constraints, be- 
cause (a) there exist bonds, (b) financial assets (including monies) are 
freely traded at the beginning of each period, and (c) the interest 
parity and purchasing power parity conditions are satisfied. 

First, observe that in a solution to problem (1) constraints (1b) and 
(le) will never be satisfied with strict inequality. For if there is a strict 
inequality in one of these constraints, say, in period ¢, one can increase 
domestic or foreign money holdings at no extra cost and use it as 
money to spend, thus increasing consumption in period ¢ without 
altering all other choice variables. This will increase the attained 
utility level. We can rewrite, therefore, constraints (1b) and (le) in 
equality form without altering the nature of the problem. Second, 
denoting by my, and m,, the components of home and foreign monies 
to hold in period ¢, 


Mae = Mri — putas t= ip De ee Le (5a) 
mp, = My — prers, Dee Ogee ied (5b) 


we can replace (1c) and (1d) with nonnegativity constraints on mj; and 


Mpt- 
Combining (1b) and (le)—written in equality form—with (5), and 
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applying the parity conditions (3) and (4), we get: 
purer + My + eympy = My + Byy + CBr, + Xm, (6) 


(1 + typi) "paece + (1 + tees) "(mae + epmpe) — (1 + tre) (Mea 
+ egmpe-1) = (1 + tye)" (Pae-1)t—-1 + Xt) — Barr (7) 
— €-1Bre-1 + (1 + toe-1) "(Bart + Bre), Cass ee 


where cy = Cyr + Cre = aggregate consumption in period t. 
Now define the discount factors: 


t 
Oh = []a +t Upeas oy bSi2nd; me tl and dpyael (8) 
T=2 


Then, by adding (7) for t = 2 to (6), adding the result to (7) for t = 3 
multiplied by dy, then adding the result to (7) for t = 4 multiplied by 
dy3, and so on, we get (using the interest parity condition [3]) the 
single wealth constraint: 


Tr T-1 
SF dtp ule + ‘i TresiCuemut + Cr+ iteeMre) 
t=1 


t=1 


+ dy(myr + epmpr — Bur — 7B rr) (9) 


T-1 T 
aa Dy: dutsiPudut + SY dinX nt- 
t=1 t=1 


On the left-hand side of (9) there are three terms. The first represents 
the present value of consumption. The second represents the present 
value of interest costs associated with holding money as a store of 
value, that is, associated with money to hold. The last term represents 
the present value of last-period money to hold minus debt. On the 
right-hand side of (9) there are three terms too—initial money hold- 
ings, the present value of dividends, and the present value of gov- 
ernment transfers. The sum of the right-hand-side terms represents 
initial wealth. Dividends are distributed at the end of each period, 
which explains why the firms’ proceeds in period t are discounted by 
dit+, and not by dj. 

Now problem (1) can be reformulated as follows. Choose ¢,, mi, Mr; 
eta oe ANG Da po 7, ON SORA O maximize Uy(C1, Ce, ..., 
Cr) subject to (9). Having found a solution to this problem, one can 
find a solution to problem (1) by choosing an arbitrary division of 
consumption in each period between home and foreign goods, then 
using (5) to solve beginning-of-period money holdings, and finally 
using (6) and (7) to solve home and foreign borrowing. 
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Applying to problem (2) the above-outlined procedure, one can 
derive the foreign counterpart of (9). Due to the interest and pur- 
chasing power parity conditions, the foreign wealth constraint can be 
written using foreign currency prices and discount factors or home 
currency prices and discount factors. The foreign country’s decision 
problem can then also be reformulated using a single wealth con- 
straint. 

The new formulations of problems (1) and (2) reveal an important 
characteristic of equilibrium allocations. If the interest rate on cur- 
rency j-denominated debt is positive in period t (j = H or F), neither 
the home nor the foreign country will use currency j as a store of 
value in period t (i.e., mj and m* will be chosen to equal zero). If, 
however, the interest rate on currency j—denominated debt is zero in 
period ¢, both countries will be indifferent toward using currency j as 
a store of value in period ¢. This means that, in an equilibrium, 
implicit interest costs associated with money to hold are zero in every 
period. This can be seen by observing that in the reformulated 
problems the “price” of home money to hold is dy4i4, and the / prices 
of foreign money to hold is dyy4.€;4;47,, while the marginal utility of 
money to hold is zero. The new formulation reveals also that in an 
equilibrium consumers make sure that the present values of last- 
period monies to hold and debts are all zero. This means that in an 
equilibrium the effective constraints on the choice of consumption 
plans are: 


r P1 r 
>, dutputr = My + Pe dutsiPutyut + yy dinX ut, (10) 
t=1 


t=1 t=1 


T TA T 
- driprec’s = My, + Sy Anes Priyrt a ey Ape X pt. (11) 
t=1 t=1 t=1 


Thus, equilibrium consumption plans are solutions to: choose c, = 0, 
Fa ees QAO INASOIZE M04, Coy =o 2.07) SULIECE10.(10); and 
choose c% = 0,f= 1,2,...,7, to maximize up(c%, cy, ... .,.c#) subject to 
(11). Due to the interest and purchasing power parity conditions 
(driprt)/(dpipr1) = (duePut)/(diipm), for t = 1, 2,..., T, implying that 
consumers face the same relative prices of consumption in both 


* This implication is a direct consequence of the assumption that one can switch 
costlessly from money to bonds and vice versa. With positive transaction costs associated 
with purchases and sales of bonds, money is used as a store of value even when interest 
rates are positive (see Baumol 1952 and Tobin 1956). This suggests that additional 
insight might be gained by extending the present model to allow for transaction costs in 
financial markets. 
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problems. Hence, every equilibrium allocation of consumption in 
which consumption demands are solutions to these problems is Pareto 
efficient. This proves the main result of this section. 

It is important to observe at this stage that the implications of being 
in an equilibrium which have been discussed so far are independent 
of the exchange-rate system; they apply to systems of fixed and 
floating as well as managed exchange rates. Every exchange-rate 
system has associated with it specific financial market-clearing condi- 
tions (the clearing conditions for commodity markets, Cy, + cH = Yue 
and cr, + ch = yr, are the same for every exchange-rate regime), 
which may have specific distributional implications. These implica- 
tions are considered in the following sections. 


IV. Floating Exchange-Rate Regime 


By a floating exchange-rate regime I refer to a regime in which no 
government agency intervenes in the foreign exchange market for 
the sake of intervention. Thus, if one was to reformulate the present 
model to include government purchases of goods and the govern- 
ment was instructed to acquire foreign exchange in order to purchase 
foreign goods, this would introduce intervention in the foreign ex- 
change market but not for the sake of intervention. However, if a 
government agency purchases foreign exchange, keeps it for several 
periods, and then resells it, we refer to this action as intervention for 
the sake of intervention. In the presence of this type of policy the 
exchange-rate regime is a managed float. 

Now consider the economies described in the previous section in a 
floating exchange-rate regime. The governments are assumed to en- 
gage in no economic activity except for the monetary transfers Xj, 
and X»; (which may be zero). This means that each currency has only 
one source of money creation—government transfers—that all debt 
is private debt, and that all commodity demands are private demands. 
Under these circumstances a floating exchange-rate regime equilib- 
rium can be defined as follows: 

DEFINITION: Prices {pPyi, Dre exchange rates {é,}4-,, interest 
rates {iyt, tres iat and the allocation {¢y;, Cr, CH, Che, Mir, Mr, M¥,, M#,, 
Bin, Br, Bi, BE} Y=, Constitute a floating exchange-rate regime equi- 
librium relative to [wy, up, My, Mr, {Xm, X res Yue» Vrodteal, if: 


a) for pu or Din» Pre = C; Ct — le ee T, and a4; = tut, Ure = Up, 
bie? Be ices, sk Ne ol = 
i) {Cu, Cre, Min, Mri, Bun, Bee, solves (1), 


il) {Cit Ch, Mi, Mi,, BH, BY, ee solves (2), 
b) Cue + Che = Jur t = 1,2,...,T 


o 
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C) Cre + CH = yr, t = LL 


: | 
d) My + Mi, =My+ > Xy,t=1,2;...,T, 


T=1 


fe 

e) Mr, + M% =Mp + > Xpnt = Le? eee Lis 
T=1 

b) Bag Be — 0,t= 1,2,3..25 1; 

Zipp tbr = 0,612) os, Ls 


In an equilibrium, individual optimization based on self-fulfilling 
expectation (condition a) is consistent with clearance of all markets in 
every period; market for home output (condition 6), market for 
foreign output (condition c), market for home money (condition d), 
market for foreign money (condition e), market for home currency— 
denominated debt (condition f), and market for foreign currency— 
denominated debt (condition g). 

The right-hand side of condition d represents the supply of home 
money in period ¢; it consists of the initial stock of money plus 
monetary injections up to (and including) period ¢. Similarly, the 
right-hand side of condition e represents the supply of foreign money 
to the private sector in period ¢. 

The main point that I wish to make in this section is that— 
independent of the transfer policies (Xy, and X;,)—in a floating 
exchange-rate regime the equilibrium allocation of consumption con- 
forms to a barter equilibrium allocation of the economies under 
consideration, and vice versa. That is, if we allow the economies to 
exchange goods over time, making all trading decisions in the begin- 
ning of the first period (as in an Arrow-Debreu model), the resulting 
equilibrium allocation of consumption will be an equilibrium alloca- 
tion of consumption in a floating exchange-rate regime, and vice 
versa. This is independent of the transfer policies (or, as some would 
call it, of monetary policies). 

Despite the fact that the equilibrium allocation of consumpton is 
determined only by real factors (the factors which determine an 
exchange equilibrium), every real equilibrium has associated with it 
equilibrium values of monetary variables like commodity prices, ex- 
change rates, and interest rates (cf. Kareken and Wallace, in press). 
The monetary variables are determined within the system. 

In order to prove the main result of this section recall the formula- 
tion of the optimal consumption plan problems that was developed at 
the end of Section III. 


Choose 
é;, = VU, pee ies Som Sel; (12) 
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to maximize 


Un(C1, C2, <2 ay Cr) 
subject to 
r [1 T 
» dit Puce = Mu + SS dut+iPutYue + » dX ut- 
t=1 t=1 t=1 
Choose 
ce = 0, bo i be tao ele (13) 
to maximize 
i hs Cpa, CF), 


subject to 


a Tai Tr 
S driprict = Mp + »s AressPreyre + » dneX re, 
=1 t=1 t=1 


Equilibrium consumption schedules are solutions to these problems 
independent of the exchange-rate regime. 

It is shown in the Appendix that, in a floating exchange-rate regime 
equilibrium, the right-hand side of the wealth constraint in (12) is 
equal to 27_.dipiyu while the right-hand side of the wealth con- 
straint in (13) is equal to 24_,dppr:yry. This means that in each country 
the present value of output equals the value of initial money holdings 
plus the present value of dividends plus the present value of govern- 
ment transfers. Hence, a country’s assets reflect precisely the value of 
its real output. Now, since (dpiprr)/(dripri) = (datPut)/(dinpui1) (due to 
interest rate and purchasing power parity), an equilibrium allocation 
of consumption in a floating exchange-rate regime—which is a solu- 
tion of (12) and (13)—coincides with an equilibrium allocation of 
consumption in a costless barter economy.* Hence, despite the fact 
that the monies play a real role in facilitating transactions, thus intro- 
ducing potential divergences from barter equilibria, in a floating 
exchange-rate regime with perfect foresight no such divergences 
exist. 

The fact that the equilibrium allocation of consumption in a float- 
ing exchange-rate regime coincides with the equilibrium allocation of 
consumption in a costless barter regime enables a rather simple cal- 
culation of equilibrium prices, exchange rates, and nominal interest 
rates. First, compute the equilibrium relative intertemporal prices in 
the barter regime. Let them be q, = 1, qo, qs, -- - , gr- Then we know 


4 This result is also derived in Lucas (1980b). 
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that in the floating exchange-rate regime prices and interest rates 
satisfy: 


eee eee On M3 | (14) 


dit Put-1 drt—-pre-1 qt-1 
Now, in the last period all money is spent on goods. Therefore, 
last-period prices are 


ata + Sn 


T=1 


e+ Sx 


T=1 


» Prr = 


Dur Ypr 


Put = 


Having last-period prices, we can determine prices and interest rates 
in period T — 1. From (14), using dye/dyg¢_; = (1 + t¢-1)~', k = H, F, we 
have 


(1+ f)Paens = PELE, (1 + inp s)Prnr = PEE. (15) 
£ 


First consider pyr_, and igr_;. Suppose ty7-,; > 0. Then all H money is 
spent on H goods in period T — 1 and 


[Me + SX] | 


Pur-1 = 
YHT-1 


Substituting this into (15), we calculate the interest rate 747_, which 
satisfies (15). If the calculated interest rate is nonnegative, we have the 
equilibrium price level and interest rate. If the calculated interest rate 
is negative, then the equilibrium interest rate is zero. Substituting ty7_, 
= 0 into (15), we can calculate the equilibrium price pyr_,. A similar 
procedure is applied to calculate tpp_, and Pp7_, . [tis now clear that we 
can use the same procedure to calculate pyr_2, ty7-2, Prr—2, and tyr-2- 
Thus, starting from last-period prices, all prices and interest rates can 
be recursively calculated. 

Observe that prices are determined according to the quantity 
theory of money. When a country’s money is not used as a store of 
value, all of it “chases” that country’s goods, and the country’s price 
level is determined according to the quantity equation with unitary 
velocity of circulation. When part of the money is used as a store of 
value, which may happen in this model only if the nominal interest 
rate is zero in this country, only part of the stock of money in 
circulation chases goods, and this part determines the price level. In 
fact, if the algorithm described above produces a zero interest rate 
in a certain period, the calculated price level of this period can be used 
in conjunction with output to calculate the quantity of money re- 
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quired for transactions. The residual quantity of money will be used 
as money to hold.® 


V. Fixed Exchange-Rate Regime: One-sided Peg 


In this section I consider a fixed exchange-rate regime in which the 
home country pegs its exchange rate to the foreign country. For this 
purpose the home country establishes an exchange-rate stabilizing 
authority (ERSA) whose task is to buy and sell foreign currency at the 
fixed rate of exchange e, which is assumed to remain the same in every 
period. The ERSA replaces the foreign exchange market. Since the 
home country cannot issue foreign money, its one-sided decision to 
peg the exchange rate makes it necessary for the ERSA to borrow 
abroad whenever it is asked to sell foreign currency, and I assume that 
whenever it is asked to buy foreign currency it lends it out. This 
assumption avoids unnecessary interest costs to the ERSA (these 
interest costs are sometimes considered as part of the costs of 
exchange-rate stabilization). Also, it is assumed that, whenever the 
ERSA buys foreign currency, it pays for it with new issues of home 
currency, thus increasing the quantity of home money in circulation. 
Whenever the ERSA sells foreign currency, it leaves in its possession 
the home currency that it receives against these sales, thereby reduc- 
ing the quantity of home money in circulation. This mechanism of 
exchange-rate stabilization approximates practices that were used in 
many countries (central banks usually perform the tasks of the 
ERSA). 

In what follows I wish to make two main points. One concerns the 
distributional implications of the exchange-rate system that is being 
considered (we know from Sec. II that the equilibrium allocation of 
consumption is efficient). It will be shown that when the ERSA is 
constrained to end up with no foreign debt, the resulting equilibrium 
allocation of consumption is the same as in a costless barter economy 
and, therefore, also the same as in a floating exchange-rate regime. 
Second, in order to meet the ERSA’s intertemporal budget constraint 
(the constraint on last period’s foreign debt), the transfer policy can- 
not be arbitrary. This means that in the present system the transfer 
policy cannot be divorced from the stabilization of the exchange rate, 
which may be considered as a disadvantage of this fixed exchange- 
rate regime.® Apart from these points we will have a closer look at the 


* An explicit calculation of an equilibrium for two toy economies is presented in 
Helpman (1979). Helpman and Razin (1981) present a similar calculation for the 
infinite horizon case. 

* A similar argument, but in a different setup, appears in Helpman and Razin (1979). 


EXCHANGE-RATE REGIMES 881 


behavior of prices and money stocks and the relationship between the 
exchange rate and the required transfer or absorption policy. 

I start by presenting the ERSA’s budget constraint. In order to save 
space let us use the convention that variables with a subscript ¢ — 1 are 
all zero for t = 1. The ERSA’s foreign debt in period ¢ is denoted by 
B#, and its addition to H money in circulation in period t is denoted by 
Xf, (Xf, may, of course, be positive or negative). The exchange rate is 
constant; that is, e, = e. 

There are two economic units which actively transact with the 
ERSA: the private sector of the home country and the private sector 
of the foreign country. The private sector of the home country has in 
period ¢ an excess demand for foreign currency which equals My; + (1 
+ ipp-)Bry-1 — Mp1 — By;; it equals total desired holdings of foreign 
currency, plus foreign debt repayments, minus the quantity of 
foreign currency transferred from the previous period (the previous 
period’s foreign money to hold), minus foreign borrowing. This ex- 
cess demand is satisfied by purchases of foreign currency from the 
ERSA, for which the private sector pays with home country money. 
The foreign private sector has an excess demand for home currency 
which equals Mj, + (1 + ty:—-1)Biie-1 — Min-1 — Bit This excess demand 
is also satisfied by transacting with the ERSA. For the purchase of 
home currency the foreign private sector pays [Mj + (1 + ta-1)Bie-1 
— m%,_, — By,\/e units of foreign currency. Hence, the deficit in the 
balance of payments which equals the net excess demand for foreign 
currency with which the ERSA is faced in period ¢ is: 


Dr = LM ry ee (1 2 tre) Bret — Mape1 Brel 


at [M#, + Ud + tnt—-1) Di-1 — Min—1 — Bi I 
e 


(16) 


Against this excess demand for foreign currency there is an equal 

value excess supply of home currency which the ERSA has to absorb. 
Hence, Xe, — —eDrt, and 

XE = —elLMae + (1 + tes) Brt-1 — Mrt-1 — Brill 

iMac Li tut) Bit-1 — Mitt-1 — Bil. 

Apart from the net excess demand for foreign currency by the private 

sector in period ¢, the ERSA needs foreign currency in order to repay 

the debt that it acquired in period ¢ — 1. Hence, the ERSA’s foreign 


borrowing in period t equals the private sector’s net excess demand 
for foreign currency plus its repayments of foreign debt: 


(17) 


Be, = [Mr + (1 + tet—1) Bre. — Mrt-1 — Bre] 8) 


= (Mii +L tytn — Mls — Fe (1 + try) Bre-1- 
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The last equation is based on the assumption that the ERSA does not 
carry forward any foreign currency; when it has foreign currency it 
either sells it or lends it out. This assumption assures efficient man- 
agement of the exchange-rate stabilizing operations.’ From (16) and 
(18), we have: 


Dy, = BR, — BRi-1 i try-1 BF 1. (19) 


The deficit in the balance of payments equals the increase in the 
ERSA’s foreign debt minus its interest payments. This is the definition 
of the deficit in the balance of payments which associates the deficit 
with the ERSA’s net sales of foreign currency to the private sectors, 
which also reflects the contraction of home currency in circulation 
due to the balance of payments. However, in this framework it might 
be more appropriate to define the deficit as the increase in the ERSA’s 
foreign debt. The difference between the former and latter defini- 
tions amounts to the exclusion or inclusion, respectively, of interest 
payments on the ERSA’s foreign debt in the definition of the deficit in 
the balance of payments. Clearly, if the latter definition is used, then 
the contraction in home money in circulation as a result of exchange- 
rate stabilization is not equal to the deficit in the balance of payments. 
Now we are prepared to define a one-sided peg fixed exchange-rate 
regime equilibrium. A formal definition is provided in Helpman 
(1979). Here I discuss informally the difference between the one- 
sided peg equilibrium and the floating exchange-rate equilibrium. 
In the one-sided peg the ERSA stabilizes the exchange rate at a 
fixed level ¢ in all periods. Hence e, = e fort = 1,2,... , I. Asa result, 
it engages in foreign borrowing (B#,) and monetary injections (X§,). 
Since the ERSA has to satisfy its temporal budget constraints and 
break even over time, all choice variables—private as well as the 
ERSA’s—have to satisfy (17) and (18). In addition, the exchange-rate 
stabilizing authority has to end up with no foreign debts: 
Bry = 0. (20) 
Of course, commodity markets have to clear, so that conditions 6 and c 
from the definition of a floating exchange-rate equilibrium apply also 
to the present regime. Finally, money markets and bond markets have 
to clear. Since our ERSA does not operate in the domestic bond 
market, it does not carry forward foreign money, and it does not 
make foreign currency transfers, conditions e and J from the defini- 
tion of a floating exchange-rate equilibrium apply also to the present 
regime. A difference arises in the equilibrium conditions for the 
home currency market and the foreign bond market, because the 


7 See, however, Helpman and Razin (1980) for a case in which, due to uncertainty, 
the ERSA has to carry forward some foreign currency. 
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ERSA borrows on the foreign bond market and it changes the supply 
of domestic money by means of the exchange-rate stabilization oper- 
ations. The market-clearing conditions for the domestic money mar- 
ket and the foreign bond market read in the present regime: 


t 

Min + Mi = My + > (Xue +X8), ¢=1,2,...,T, (21) 
T=1 

Bro BE? BE, = 0, (= 1,2,4..,T. (22) 


First let me prove that the one-sided peg fixed exchange-rate re- 
gime equilibrium allocation of consumption is the same as that in a 
barter economy. For this purpose recall the formulation of the opti- 
mal consumption plans that was developed at the end of Section II, 
and let us use formulations (12) and (13) for present purposes. It is 
sufficient to show that in the home country’s problem (12), the right- 
hand side of the wealth constraint equals the present value of output 
in equilibrium. 

In a fixed exchange-rate regime the interest rate parity condition 
(3) reads ing = tp, = 4, t = 1, 2,..., 7, which implies equality of 
discount tactors: that is, d,, = d,;, = d,,t = 1, 2,..., 1: Hence, the 
equality between the right-hand side of the wealth constraint in (12) 
and the present value of output translates in the present context into: 


T-1 T z 
My + OF dis Purdue + Os aX ut = A diputyut 
t=1 1 t=1 


which is equivalent to 


Lt 


pat iB 
Ds (diss — dt) pudat — Irpurdur = ~My — oe AX nt. (23) 
t=1 t=1 

Now if 2, > 0 (i.e., d44, < d,), money is not used as a store of value and 
all home money is spent on home goods, implying: 


(div ~ 4) Pudue = (dees ~ d)| Mu +>) Xue + Bid]. (24) 


T=1 
If 2, = 0 (1.e., d,,, = d,), then (24) holds trivially. Hence, (24) holds for 
t= 1,2,...,7 — 1. In the last period all money is spent on goods; 
therefore paryyr = My + 27-,(Xy, + X¥,). Combining the last result 
with (24) and’ applying the method employed in the Appendix, one 
gets:® 


T1 
Sy (dt44 — di) Pre) ut a Appuryur ai 
t=1 


bf 
=My— > dudXne + Xi). 


t=1 


8 For details see Helpman (1979). 
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By comparing the last result with (23) it is clear that in order to prove 
(23) it remains to show that >f_,dy,X}, = 0. From (17) and (18), 
using the interest rate st er condition 14; = tp; = 4, we have: 


= e[(1 + 1,_,) BF, — BF,I. 


Hence, using the convention Bf, = 0 and the fact that d,(1 + %4_,) = 
d;-1, we calculate 


Ais 
Dd 4Xhi =e y dl(1 + i,,)BR,-. — BR 
t=1 


T T 
ad (Ss d,_,BRy_, — oe ‘sh = —ed,Bi, = 0, 


which completes the proof (the last equality results from [20)]). 

I have shown that the equilibrium allocation of consumption in the 
present exchange-rate regime is the same as in a costless barter econ- 
omy and, therefore, also the same as in an equilibrium of a floating 
exchange-rate regime. This means that the welfare level achieved by 
each country is the same in each (equilibrium) of these three alterna- 
tive modes of organization of the world economy. What I wish to 
argue now is that the country that chooses to stabilize the exchange 
rate in the one-sided peg fixed exchange-rate regime cannot pursue 
an independent monetary (transfer) policy. In particular, for given 
transfer policies in the two countries, there is exactly one level at which 
it can stabilize the exchange rate and end up with no foreign debt. Put 
differently, if the country chooses to stabilize the exchange rate at a 
particular level, the requirement that it ends up with no foreign debt 
imposes on it a particular transfer policy; the present value of its 
transfers is uniquely determined, although their distribution over 
time can be freely chosen. Changes in the pattern of transfers change 
the pattern of deficits in the balance of payments but not the equilib- 
rium pattern of consumption, provided the present value of transfers 
is maintained at the equilibrium level. 

In order to prove my arguments, observe that the purchasing 
power parity condition py = epp, and (23) imply: 


T=1 
My + ss aXm = ¢| deers a oy (d, — dis bead} (25) 
1 t=1 


Since in the last period foreign money is spent on foreign goods, 


if 
[Ms +> Xn 
Per = ———_1—_.. 
rr. 


(26) 
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16a 0 E417 258-2, fF — 1} then in period’ all foreign money 4s 
spent on foreign goods. If 7 = 0, then d; = d;4,. Therefore, 


[a + Se) 


(dt — des) Pre = (de — de+1) neat anak) fea eee ee Ie 
Fr 
(26a) 
In addition, 
d 
ae ey ee t=2,3,...,T.q4 = 1, (26b) 


dt-sprt—1 9t-1 ; 


where gq; is the equilibrium marginal rate of substitution between 
consumption in period ¢ and consumption in period 1. From (26) 
foreign prices and the discount factors are uniquely determined. 
They are, in fact, identical to the equilibrium foreign prices and 
foreign discount factors in a floating exchange-rate regime because 
the equilibrium marginal rates of substitution q, are the same in both 
regimes. Hence, the equilibrium value of the expression in the brack- 
ets on the right-hand side of (25)—which is positive—is independent 
of the home country’s transfer policy and exchange-rate level. There- 
fore (25) implies a one-to-one relationship between the exchange-rate 
level and the present value of home money transfers. 


VI. Fixed Exchange-Rate Regime: Cooperative Peg 


The last exchange-rate regime that I wish to consider is one in which 
both countries cooperate in the stabilization of the exchange rate. For 
this purpose the countries establish an ERSA which buys and sells 
currencies at a fixed rate of exchange, e units of home currency per 
unit of foreign currency. The cooperation consists of each country’s 
supplying the ERSA with its own currency whenever the ERSA is 
asked to sell it and is short on it. The ERSA does not engage in 
borrowing and lending. When foreign currency is sold to the ERSA, 
this transaction reduces the quantity of foreign currency in circula- 
tion. It also increases the quantity of home currency in circulation, 
with the increase equal in value to the reduced amount of foreign 
currency in circulation, since sales of foreign currency mean pur- 
chases of home currency. The opposite effect occurs when home 
currency is sold to the ERSA. This means that the operations of the 
ERSA do not change the aggregate value of monies in circulation; 
they only change their composition. However, transfer policies affect 
the aggregate quantity of money in circulation. The present regime is 
in line with recent modelings of fixed exchange-rate regimes in that 
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the stabilization of the exchange rate per se does not affect the 
aggregate supply of money in the world economy.® 

In the cooperative fixed exchange-rate regime the changes in 
money supply caused by the ERSA are (using the interest parity 
condition and the convention that variables with subscripts ¢ — 1 are 
zero for t = 1): 


Xhp = (My, + (1 + 4~-1)Bre — Mrt—1 — Bri] 


(27) 
_ (MA, + +4) Bi ee — min — Bird : 
e 
Xie = (M&, + (1 + Bi — mit — Bin (28) 
— e[My, + (1 + 4) Briar — Mrt—1 — Bril. 
Thus, 
XE, +eX#, = 0, Le ee Ae (29) 


The definition of an equilibrium should by now be obvious and I 
abstain from stating it. 

Using the equilibrium and parity conditions, one derives—by 
means of the method employed in the previous section—the follow- 
ing budget constraints on consumption vectors: 


Z r T 
ys dipute = > diputyut — Ss di Xint, (30) 

t=1 t=1 t=1 

r r r 
sy diputci = De dipuryrt — & >! d,XFt- (30a) 

f=1 = t=1 


Due to (29), (30) implies that the equilibrium allocation of consump- 
tion in the cooperative fixed exchange-rate regime coincides with the 
equilibrium allocation of consumption in a barter economy if and only 
if the present value of monetary injections of currency H (or contrac- 
tions of currency F) caused by the operations of the ERSA is zero; that 
is, &4_,4,X#, = 0. I have shown in the previous section that in the 
one-sided fixed exchange-rate regime 27_,d,X#, is necessarily zero in 
an equilibrium. Here this is not the case; in an equilibrium of the 
present regime the present value of home currency injections 


* If in a cooperative peg only one country sterilizes changes in its money supply that 
result from balance-of-payments adjustments, then the behavior of money supplies in 
this regime will resemble the behavior of money supplies in a one-sided peg regime. It 
is interesting to note that the prediction of the one-sided peg regime about the behavior 
of the world price level is consistent with the empirical findings in Genberg and 
Swoboda (1977) (i.e., that the money supply of the Passive country, the United States, 
explains the world’s dollar price level). Since they took the standard view of a fixed 
exchange rate (i.e., a cooperative peg), they explained their findings with an argument 
about sterilization on the part of the United States, an argument that they did not test. 
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through the balance of payments need not be zero." If it is positive, 
the home country’s present value of consumption is less than the 
present value of its output, and the foreign country’s present value of 
consumption exceeds its present value of output. If it is negative, the 
opposite holds. In the former case it is as if the home country is 
making a transfer to the foreign country, while in the latter case it is as 
if the foreign country is making a transfer to the home country. Since 
in either case the allocation is efficient, then—to the extent that the 
transferer loses in welfare terms (which is not always the case)—in the 
former case the home country is worse off and the foreign country 
better off than in a floating exchange-rate regime (or a one-sided 
peg), while in the latter case the opposite holds. Since the outcome 
depends on the level of the exchange rate and on each country’s 
transfer policy, countries have conflicting interests in the determina- 
tion of the exchange-rate level and monetary policies. 


VII. Concluding Remarks 


I have presented in this study an evaluation of exchange-rate regimes 
for a world economy in which there exists a single friction; that is, 
there exists a payments period and monies are required for market 
transactions. It has been shown that this friction has certain distribu- 
tional implications in each exchange-rate regime, but it does not 
introduce inefficiencies into the economic system. All results are 
easily extended to a many-country framework. In a way, these results 
provide a benchmark for the evaluation of more realistic models and 
their implications about exchange-rate regimes. 

There are various rigidities and imperfections (e.g., wage contracts, 
market imperfection) whose existence may have a bearing on the 
relative desirability of alternative exchange-rate regimes. Thus, it is 
possible that a particular exchange-rate regime performs better than 
others under certain types of imperfections. A characterization of 
imperfections under which each exchange-rate regime performs best 
will be valuable. Some work on the effects of uncertainty and imper- 
fect capital markets is reported in Helpman and Razin (1980) and 
Lucas (1980b). Lucas provides conditions under which a floating 
exchange-rate regime is equivalent (from the allocational viewpoint) 
to a fixed exchange-rate regime, despite the existence of uncertainty 
and incomplete capital markets, while Helpman and Razin provide 
conditions under which a floating exchange-rate regime performs 
better than a one-sided peg. 

There is also another aspect of exchange-rate regimes which has 


1 An example showing this point is presented in Helpman (1979). 


888 JOURNAL OF POLITICAL ECONOMY 


received no attention in a welfare context but which is of major 
importance, and this is the strategic behavior of governments. I have 
shown that the equilibrium allocation of consumption is efficient in 
every exchange-rate regime and that, in a floating exchange-rate 
regime as well as in a one-sided peg regime, it corresponds to the 
equilibrium allocation of consumption in a barter economy. A crucial 
assumption on which these results are based is that expectations are 
self-fulfilling; that is, there are no surprises. In particular, the transfer 
policies are announced at the very beginning, and each government 
honors its declared policy. What happens if in the course of time one 
of the governments deviates from its announced policy? This type of 
surprise destroys both results. This can be seen as follows. Suppose 
that at some point in time the foreign country is a net creditor in 
terms of foreign currency and it has zero net debt in terms of home 
currency. Then, by reducing the quantity of foreign money it in- 
creases via exchange-rate changes the real value of payments that it 
will receive for its outstanding credit." Thus, governments have an 
incentive to generate monetary surprises. But if they do it, one cannot 
analyze the consumer problem the way it was done in this paper, 
because the consumer will learn about the possible surprises and will 
accordingly adjust his decisions. Other governments will also adjust 
their policy. Hence, a game-theoretic framework, with an appropriate 
reformulation of individual problems, is required in order to analyze 
these issues properly. 

The incentive for strategic behavior is even more apparent in the 
cooperative-peg regime. Recall that in this regime the exchange-rate 
level and the preannounced monetary policies determine real trans- 
fers among countries. There are, therefore, conflicting interests in the 
choice of an exchange-rate level and monetary (transfer) policies. In 
particular, once an exchange-rate level has been chosen, then in the 
course of time countries may try to secure for themselves higher 
welfare levels by departing from the preannounced monetary 
policies. Such unexpected departures generate allocational inefficien- 
cies. Again, in order to analyze this problem properly, one needs an 
appropriate game-theoretic framework. 

The need for an extension of the present analysis in order to take 
account of strategic behavior is clear from the context of this paper. 
However, such an analysis may also help to explain changes in 
exchange-rate regimes that took place in the world economy, such as 
the collapse of the Bretton Woods system. 


A numerical example showing this effect is presented in Helpman (1979). Similar 
effects exist in the one-sided peg regime. 
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Here I prove that in a floating exchange-rate regime equilibrium the right- 
hand side of the wealth constraint in (12) equals 24.diaPurYae- A similar 
method can be used to show that the right-hand side of the wealth constraint 
in (13) equals 24_,dpiPriye:- The proof proceeds as follows: 


T-1 T T Tr 
My t S does Puc a > daXne = > dnb hi (Mu ah > dn, 
t=1 t=1 t=1 t=1 
T-1 
(Al) 
te S Guess — dy)Paue — Cur uryur- 


t=1 


However, I have shown in Section III that for 274, > 0 (or diz4, < dy) no one 
uses domestic money as a store of value, and that in the last period all money 
is spent. These results, together with the equilibrium conditions 6 and d in the 
definition of a floating exchange-rate regime equilibrium, imply 


t 
(dit+1 — dyt)P ut) ut = (ditwi — dind(Mr ats DXi) 4 ele tee al aay ls 


T=1 


T 
Puryur = My + > ke 

T=1 
This is so, since when domestic money is not used as a store of value in period 
t, all of it is spent on goods, implying py:Cu: = Mar and purlin = Mm. Using 
condition b this implies that paiva = Mar + Mi, which yields priya = Ma + 
>*_,X,,, when combined with d. Thus, the above relations hold for 2, > 0, but 
they hold also trivially for ty, = 0 since ty, = 0 if and only if diz4, = dz. Using 
these results, we obtain: 


rs 


T-1 t 
Gas, — din)Pudan — CarParar = ada = dnd Mu a DXi 
T=1 


t=1 t=1 


rE Tr 
7 diyr| Mu a > Xx] te ae Syhoee 
t=1 


T=1 


(A2) 


The last equality is obtained from the fact that 
71 T1 1 T1 
> duces om Wier. — dup = Deas . Died = -dy, = —1, 
t=1 t=1 t=2 (=1 
and ; 
T=1 t P—1 t T 
ie ae i Sdn > Xue a Bin are 
t=1 T=1 t=1 1 T=1 
T-2 t TA t 
=a eae ei Ya ks i dypX ur 
t=1 T=1 t—2 =1 
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T= T=1 Tad t T 
7 > due) Xi = Didi — Xi — dypX yr = — Dy daekae 
t=2 =1 t=2 il a 


Combining (A1) with (A2), we obtain 
TI r Tr 
My + » dieePudn a ce = S dubuome 
t=1 t=1 i=1 


This completes the proof for the home country. A similar proof applies to the 
foreign country. 
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Energy-production subsidies are, paradoxically, shown to be likely to 
increase U.S. dependence on imported oil. Standard studies of net 
energy yields are shown to be seriously biased upward for two rea- 
sons. First, many omit indirect energy inputs, which, our input- 
output calculation shows, probably causes large errors. Second, those 
studies all omit the energy opportunity costs of nonenergy inputs 
(e.g., the fuel substituted elsewhere for the labor used to produce 
energy). We prove that, absent externalities, any fuel-output subsidy 
which causes an otherwise unprofitable expansion must yield an 
incremental fuel output smaller than the increment in energy input 
plus the energy opportunity costs of other inputs. 


The policy measures under consideration to decrease our de- 
pendence on imported oil now seem to include two leading candi- 
dates: (i) taxes or tariffs to discourage consumption and increase the 
profitability of alternative energy sources and (ii) a program of sub- 
sidies to innovative energy sources. Politicians have generally em- 
braced the latter as the one means of salvation from an alternative 


We are sincerely grateful to the Sloan Foundation, whose grant greatly facilitated 
completion of this study. Sue Anne Batey Blackman provided invaluable research on 
net-energy study techniques in practice and has given us very helpful comments. We 
are also grateful to James Sweeney of Stanford University for valuable comments on an 
earlier draft which helped us to avoid some misleading statements of our results. 
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that appears to them to be politically suicidal, while economists have 
almost uniformly favored the former, arguing that the program of 
special subsidies is an invitation to inefficiency. 

In this paper we will come to a similar but considerably stronger 
conclusion. We will argue that the difference between the two ap- 
proaches is potentially more significant and serious than even most 
economists had imagined. Our main conclusion can best be stated in 
the form of an example. Both programs we are considering can be 
used to stimulate the production of oil from sources such as shale. 
Any expansion in this activity stimulated by a tariff on imported oil 
can be expected to have a positive net energy yield and, in particular, 
a positive net yield of oil. However, an expansion in the same activity 
induced by a program of subsidies is likely to have a negative net yield 
of petroleum (!) if other outputs in the economy remain unchanged. 
Of course, in practice the subsidy is likely to cause some reallocation 
of resources that reduces at least some other (energy-using) economic 
activities. The final result may well be an overall increase in the 
nation’s stock of energy. However, we will show, under very plausible 
assumptions, that the increase in oil stocks could always have been 
made greater still if exactly the same reduction in other energy-using 
activities had occurred by itself, with no expansion in oil shale pro- 
cessing. The implications for current policy are, clearly, more than a 
little disquieting. 

This result actually has a straightforward explanation, as we will see. 
Basically, the extreme difference between the efficiency of the general 
tariff and that of the targeted subsidy lies in the fact that the latter 
does not lead to the appropriate substitutions of energy and other 
inputs throughout the economy that should be part of a program to 
increase oil stocks. Consequently, the subsidy induces the employ- 
ment in oil shale processing of inputs which could have been substi- 
tuted elsewhere in the economy for larger quantities of oil than they 
can yield through oil shale conversion. 

Several comments are required before we turn to the body of our 
analysis. First, the basic premise on which the entire discussion pro- 
ceeds is that vulnerability of the nation to blackmail by foreign gov- 
ernments is a true social cost and can legitimately be treated as a 
detrimental externality of any increase in oil consumption. This and 
self-defense against economic exploitation by the cartels constitute the 
justification of a tariff on oil imports. In the absence of these two 
problems it could be argued that the rising cost of petroleum is an 
ordinary resource-depletion problem which can, perhaps, be handled 
reasonably well (distributive considerations aside) by the market, 
uninfluenced by any particular government intervention. 
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Second, it should be obvious that government programs other than 
the two we are considering are also possible. For example, subsidies to 
new energy sources can be accompanied by a set of taxes on use of oil 
which together will constitute a reasonably efficient program. This is 
rather analogous to the substitution of a combined program of taxes 
and subsidies for a system of effluent charges.1 However, because of 
the difficulties of designing and administering an ideal mixed pro- 
gram, together with the fact that it is (apparently) not being consid- 
ered by decision makers anywhere, we have chosen not to deal 
explicitly with such alternative measures. 

Finally, it must be emphasized that while we believe that a tariff or 
some close substitute is the best way to encourage investment in solar 
energy, biomass, and other innovative energy sources, and.that direct 
subsidies are likely actually to yield net energy losses in the sense 
indicated, we will see that there can be exceptions. Where substantial 
outlays on research and development must precede a substantial 
increase in such an energy-producing activity, and if they introduce 
significant free-rider problems, then for obvious reasons the market 
will not allocate a sufficiently large quantity of resources for the 
purpose. In such a case a subsidy may be entirely justified. But our 
analysis suggests that such exceptions should be permitted only on the 
basis of extremely strong evidence, in light of the danger that such a 
subsidy will actually exacerbate the nation’s energy problem. 

In this paper we will also show that the standard net-energy studies 
are characterized by a bias toward overestimation of the net yields of 
the processes they examine, only part of which has previously been 
recognized. This bias is not a matter of overenthusiasm of inves- 
tigators, which leads them inadvertently or deliberately to distort their 
results, but is the consequence of several technical shortcomings in 
their study procedures. One of their difficulties is that, while net- 
energy studies commonly take into account the direct fuel inputs used 
in the processes which they examine, many of them consider only part 
of the remainder of the multiplier-like series—the materials and 
machines needed to make those materials and machines, and so 
on—all of which, obviously, have their own energy requirements. We 
will provide empirical evidence which indicates that the omissions are 
sometimes far from negligible. Moreover, even when the total energy 
requirements of a new process are “correctly” calculated using input- 
output techniques, only the average energy requirements per unit of 


1 We are indebted to James Sweeney for drawing our attention to this sort of mixed 
policy. Note, however, that a mixed program is not likely to be a perfect replacement 
for a tariff on oil imports. Thus, as is well known, in environmental programs partial 
substitution of subsidies for effluent charges is likely to lead to inferior results. 
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energy output will thereby be determined. The marginal energy re- 
quirements are likely to be higher than the average requirements, and 
therefore the marginal net energy yield may be considerably lower 
than the calculated average energy yield. 

But this is not their only omission. The nonenergy resources that go 
into an energy-production process have an energy opportunity cost, 
as we have already noted, for they would instead have been used, at 
least in part, as substitutes for energy in other sectors of the economy. 
This energy opportunity cost is generally left out of net-energy 
studies, though it is in fact a very real concomitant of the production 
process. As a result of all of the biases that have just been mentioned, 
a process which appears to pass the net-energy test by a narrow 
margin is nevertheless likely to fail it in fact. 

While the indirect energy costs and the energy opportunity costs of 
a production process may prove very difficult to measure directly, we 
will show here that there is (at least at the margin) a duality relation 
between profitability and net energy yield which permits the former 
to stand as a proxy for the latter. We will prove that any energy- 
production process which is unprofitable can from this fact alone be 
considered a potential net energy loser in the sense indicated earlier. 
Consequently, we can no longer presume that energy-losing projects 
are apt to be rare and pathological exceptions. Rather, we are forced 
to regard any innovative energy project which can be carried out only 
with the aid of government subsidies as a proper subject of suspicion. 


I. The Energy Opportunity Cost of Nonenergy Inputs 


The first issue we raise is the failure of many net-energy studies to 
carry their calculation of the energy used in the production of energy 
a sufficient number of “rounds” into the past, to determine the energy 
content of the inputs used to make the inputs, and so on, used in the 
energy-production process. Many of the net-energy studies employ a 
procedure called “process analysis” which involves explicit engineer- 
ing descriptions of all the steps involved in a manufacturing process, 
and it is easy to see why it is impractical for this method to encompass 
many earlier rounds. Although this problem has previously been 
recognized in the literature, the magnitude of the resultant bias seems 
not to have been investigated. We therefore carried out some calcula- 
tions to evaluate the magnitude of the error, using 1972 input-output 


” See, e.g., Berry 1978-79. It is noteworthy that some of these studies are careful to 
warn the readers of this source of bias (see, e.g., Perry et al. 1977; Berry 1978-79; Fels 


and Williams 1978), while others (e.g., Baron 1976, 1978; Hunt 1976) do not mention 
the subject. 
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data provided by the Bureau of Economic Analysis* and 1967 energy 
coefficients provided by the Center for Advanced Computation 
(1975). Letting the matrix 4 represent the Leontief interindustry 
coefficient matrix and D the vector of direct primary energy con- 
sumed per unit of sectoral output, the total energy consumed per 
unit of sectoral output E is given, for an average sector, by E = Di — 
Ayr. 

In some process-analysis studies, only the direct energy cost per 
unit of output of a new energy-producing process is considered, only 
taking account of the vector D. In others, the energy cost includes the 
“first-round” energy content of the new process and is evaluated, in 
effect yielding the sum (D + DA). Reportedly, only the most ambitious 
process analyses include the “second-round” content, evaluating the 
energy cost (D + DA + DA’). Thus even these studies omit the portion 
(DA® + DA* + ...) of the cost of a new energy process, and so they 
overestimate its net energy yield correspondingly. 

We carried out calculations to estimate the magnitude of the errors 
that can result.® We first looked at the discrepancy between the direct 


8 For a description of the data, see Ritz (1979) and Ritz, Roberts, and Young (1979). 

* The direct energy requirements for each sector are valued in British thermal units 
and were divided into four sources in the original data: (1) coal, (2) petroleum prod- 
ucts, (3) natural gas, and (4) electricity. This breakdown created an immediate problem 
for the calculation of total energy requirements, for electricity is not a primary energy 
source but a manufactured form of energy. It must be produced from coal, petroleum, 
natural gas, uranium, running water, geothermal energy, solar energy, or some other 
natural source. Thus there is double counting in the original energy matrix which 
includes both the oil, coal, and natural gas used to produce electricity and the electricity 
itself. Therefore, we first assigned the primary-energy sources used to produce elec- 
tricity to the electricity-using sectors, since we still want to classify the electricity as a 
direct energy input. Thus we pretended, in effect, that each industry produces its own 
electric power (as some, indeed, do). In the 85-sector scheme, there are two sectors that 
sell electricity—utilities (sector 68) and state and local enterprises (sector 79), which 
includes public utilities. Our first step was, then, to distribute the coal, petroleum, and 
natural gas used in the two utility sectors to the nonutility sectors in proportion to the 
dollar purchases of the nonutility sectors from the utility sectors. The next step in the 
assignment of electricity generation among its users was to distribute the intermediate 
inputs of the two utility sectors to the nonutility sectors in proportion to the dollar 
purchases by the latter from the former. The last step was to enter zeros into the cells 
recording the purchases from the utility sectors by the nonutility sectors. 

° We carried out two more modifications in our estimation procedure. The first was 
to take into account the energy content of the fixed capital stock. (The A matrix consists 
only of intermediate flows.) Since a piece of capital is used to produce more than one 
unit of output, the appropriate addition is the total energy content of the capital stock 
of each sector per unit (or, in our case, per dollar) of output. To estimate this, we added 
a row of depreciation coefficients (depreciation per dollar of output) to the A matrix. 
Moreover, we evaluated the energy content of the depreciated portion of the capital 
stock per dollar of output not at its historical value but at its replacement value. To do 
this, we assumed that new capital stock by component was added in each sector in 
proportion to the sectoral components of the gross private capital formation vector in 
final demand. A new coefficient column was therefore added to the 4A matrix corre- 
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energy requirements and the total energy requirements in the man- 
ufacturing sectors in the input-output table. The (weighted) average 
ratio D,/E; was 0.36 (where the subscript 7 refers to sector 2), meaning 
that such a calculation, on the average, omits some 64 percent of the 
total energy cost. The individual ratios ranged from 0.09 in the 
apparel industry to 0.62 in chemicals. Next, we looked at the ratio of 
the direct plus first-round energy requirements (D + DA) to the total 
energy requirements, which averaged 0.66 and ranged from 0.30 in 
apparel to 0.83 in chemicals. Finally, the ratio of direct plus first- 
round plus second-round energy requirements (D + DA + DA?) to 
total energy requirements averaged 0.78 and began from as low a 
figure as 0.53. Even using the direct plus first-round plus second- 
round estimates of energy requirements, we find an average error of 
the order of 28 percent, which can severely bias standard estimates of 
net energy yield. This means that if a process-analysis calculation 
indicates that a given process will offer a positive net energy yield as 
high as, say, 20 percent, it is in fact likely to be a net energy loser. 

But the omission of some indirect energy uses by net-energy studies 
is not the only significant source of overoptimism in their estimates of 
the net yield of new energy sources, for the calculation still does not 
take into account the potential energy-saving powers of the labor, the 
metals, and the other primary resources (aside from energy) used 
directly and indirectly in the production of energy. We will provide a 
formal discussion of this point, whose implication will be that any 
unprofitable energy source is a proper candidate for suspicion that it 
will be a net energy loser, and in later sections still stronger proposi- 
tions will be offered. But first a simple example may help to indicate 
the nature of the issue. 

Suppose that an economy has two outputs, manufactured energy 
and bread, and two primary resources, energy and labor; that bread 
can be produced by either of two processes, one labor intensive and 
one energy intensive; and that both processes are in fact currently in 
use. Then, other things being equal, if employment is constant, every 
labor hour used in the manufacture of energy increases the amount 
of bread that must be produced by the energy-intensive rather than 
the labor-intensive process. The corresponding increase in energy use 


sponding to the depreciation row which was equal to the percentage distribution of 
gross private capital formation over its components. The second adjustment was to add 
some measure of the energy content of the (intermediate) imports used in production. 
Since we are not really concerned with the foreign energy content of the imports but of 
the domestic energy opportunity cost, the relevant measure is the energy content of the 
exports traded in exchange for these imports. To calculate this, we added a vector of 
export coefficients corresponding to the import coefficient row. This procedure im- 
plicitly values the energy content of the imports at the energy content of a correspond- 
ing dollar’s worth of exports. 
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is the energy-opportunity cost of the hour of labor.* Obviously, this 
should be included in a full calculation of net energy. If the man- 
ufacture of 1,000 Btu of energy uses up 900 Btu directly and indi- 
rectly and requires labor time which would instead have been em- 
ployed as a substitute for another 200 Btu, the process in question 
clearly depletes society’s stock of energy on balance. A calculation 
which omits this opportunity cost is simply incorrect and misleading. 


II. Profit as an Indicator of Energy Opportunity Cost 


Direct estimates of the magnitudes of such opportunity costs may not 
be easy to obtain, but we will show that the profits of the energy 
manufacturing firm can tell us a great deal about the magnitude of 
the energy opportunity cost of its operations. Let the economy’s 
production function for manufactured energy satisfy 


TRG ee at tare Fins Tis > Ta) = 05 e) 


where y, is the quantity of energy output, 7, ; is the quantity of primary 
input 7 used in energy production, and r; is the quantity of primary 
resource j used as input, directly and/or indirectly in all other outputs. 
Let r,, and7, represent the quantities of energy input, for the moment 
taking all energy to be homogeneous.’ 


6 The reality and significance of the indirect energy cost of energy production and of 
the energy opportunity cost of its other inputs is underscored by recent newspaper 
accounts. A fall in demand for nuclear electricity has led to a decrease in production of 
enriched uranium sufficient, according to the editor of Electrical World magazine, to 
reduce electricity use in the enrichment process by an amount equal to | percent of U.S. 
electricity consumption (New York Times 1980). And a page-1 story (Shabecoff 1980) 
indicates that rising energy prices are leading to a long-term shift toward the use of 
more labor in place of energy (e.g., use of additional workers to insulate homes and 
factories and even the hiring of more cowboys to drive cattle overland rather than 
transport them by truck), and it reports an estimate by Dale Jorgenson and E. A. 
Hudson which suggests that a doubling of energy prices can increase employment by 5 
percent. Although some may think that these estimates exaggerate the magnitudes of 
the indirect energy costs and the energy opportunity costs, they clearly show that these 
are real and tangible phenomena which can be expected to become increasingly obvious 
if energy prices continue to rise. 

7 Here, the y,’s refer to net output—that is, final output—instead of total output. 
Thus the vector of final demand is 


vi 


Ym 


An intermediate output will, essentially, be valued in terms of its primary-resource 
content. More specifically, using a standard input-output structure, define 
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Then the production of energy will be profitable if* 
Pim 2 LP imi» (2) 


where p, is the price of both the energy output and the energy input 


to be the n by m coefficient matrix, showing the amount of each primary input 1 
through 7 required per unit of output for each of the m sectors. We will let the first row 
of q contain the direct energy inputs in each of the sectors per unit of output. 


Therefore, 
itd. 
qq | qai tem) 


Gni--+4Ynm 


The sum of the direct and indirect primary inputs required per unit of final demand is 
then given by matrix s: 


CiterisrO ni 
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where the first row of s indicates the direct plus indirect requirements of the primary 
energy source. Then, for a given vector of final demand, y, the total primary input 
requirements are given by vector w: 


Vie Joo... amp 
S21 + YoSoo + .-~ + YmSom 


w= 
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We can use the expressions 


Ti1 = V1e1 T= Volo te a. te mlm 
Tio = VSo1 Y2 = YoSeo +... + VmSom 
. and 

Tin = ViSni Tn = YSn2e + - ++ + VmSnm 


to define the symbols used in eq. (1). Note, moreover, that the same calculation can be 
reperformed using several alternative techniques for each of the sectors—i.e., alterna- 
tive column values for a and q. This can yield a continuous production function F in the 
limit. 

* Here it is assumed that energy is produced in a process that is completely separate 
from other outputs so that 7;, can be measured unambiguously because there are no 
common input costs. 
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used by it.° Here and throughout the remainder of the paper, we will 
take p, to be a constant; that is, we will assume that the United States is 
a price taker on the world energy market. This permits us to avoid the 
complications of analysis of strategic price behavior that the United 
States might want to consider. However, a bit will be said on this 
subject later. 

Assume that input prices either are given exogenously or are for 
some other reason beyond the control of the firms that use them. 
Moreover, whether or not input-using firms are competitive, assume 
that all, or at least a substantial number, of them are cost minimizers, 
that is, that they seek to produce whatever output vector they select at 
minimum total cost. Then, as is well known, they will purchase those 
input quantities that satisfy 


pilp: => Tiga, = 07,,/07;; = Or,/0r;, (3) 


where F,; = 0F/0r,;, and so forth. Consequently, the profitability 
criterion, (2), can be rewritten as 


n n 
v1 Ea Irae + ee eta + S r8rnlOr; 
j=2 j=2 


(4) 


n 
14 ae Dy 7,;07,/07;. 


j=2 


III. Tendency of Total Profit to Underestimate Total Net Energy 
Yield 


The equivalence of (2) and (4) immediately suggests the relation 
between unprofitability and net energy loss, for 


1 ;07,,/07, ; = 7, ;07,/07; (5) 


appears to be an indicator of the energy opportunity cost of input j 
used in the energy manufacturing process: It is the quantity of input j 
used multiplied by the marginal quantity of energy input for which a 
unit of j can be substituted, ceteris paribus; that is, it is r,; multiplied 
by the marginal rate of substitution of 7 for energy. Unfortunately, (5) 


® One should note that here a crucial feature of energy production enters, namely, the 
fact that both the output and one of the inputs are really the same commodity. This is 
what permits us to assign the same price, p,, to both the energy input, 7,,, and the 
energy output, y;. This is rather reminiscent of Quesnay’s Tableau économique, where the 
output of agriculture was precisely the same product as its input—corn. The fact that 
energy is both an input and the output in this process permits us to divide (2) through 
by the product price, p,, to obtain a ratio of input prices, p j/p,, which can be interpreted 
with the aid of efficiency condition (3). For noneconomist readers we may note that the 
standard relation (3) is obtained directly by minimization of total cost, 2p, ;, subject to 
the production function constraint F(-) = 0. 


. 
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is not the opportunity cost we are seeking, and we will show now that 
the correct figure is likely to be lower than (5) in many plausible 
circumstances. 

The figure we really want corresponds to the saving in energy 
which would be permitted not by a small reduction in the use of 7 in 
activity 1 but by total elimination of the use of j for this purpose. To 
permit us to describe the issue with the aid of a two-dimensional 
diagram first, assume there are only two inputs, energy, 7, and 
another input, r.. The upper part of figure 1 represents the man- 
ufacture of all goods other than energy, and yy’ is the isoproduct locus 
corresponding to the initial equilibrium, with point E representing 
the initial input combination. 

Immediately below point E we have drawn the 7; axis of a graph 
representing the manufacture of energy, with E, its initial input 
vector, so that 74. is the initial quantity of input 2 used in the process. 
If all firms are cost minimizers, dr,/dr., the slope of yy’ at E, will equal 
dr,,/dr,, the slope of yyy; at E;. 

Transfer of all of 7,. from production of energy to production of 
other goods, holding the output of the latter constant, will move the 
economy from point E to E’ on curve yy’ in the upper figure, thereby 
reducing the use of input 1 by SE’ = Ar, = ry2Ar,/Ar,. This is the true 
energy opportunity cost of the use of input 2 in production of energy. 

In general terms, to determine the energy opportunity cost of input 
j, we must take the quantity of this input used in energy production to 
fall from 7,; all the way to zero. Simultaneously, we must take that 
amount of input/, Ar; = 7,;, to be transferred into activities elsewhere 
in the economy and determine the quantity of energy that Ar; can 
replace in those other industries, holding all those outputs constant. 
The opportunity-cost figure we are seeking, then, is 7,;Ar,/Ar;, the 
decrement in use of energy permitted by the incremental use of Ar; = 
r,. elsewhere in the economy. Thus our net energy criterion becomes, 
instead of (4), 


n 
Vee > r,;Ar,/Ar;. (6) 
j=2 
This relationship constitutes our definition of the net energy yield of 
an activity: It measures whether the energy output of the activity is 
greater than the direct and indirect use of energy inputs plus the sum 
of the energy opportunity costs of all other inputs used in energy 
production if all other activities in the economy remain constant. 
Let us now see the relation of net energy criterion (6) to the 
profitability criterion (4) and, in particular, to the terms 7,;07r,/07; 
which it employs. In figure 1 draw the tangent ET to yy’ at E. Then, 
clearly, 


7407,/0r2 = ST > SE' = 1,,Ar,/Aro. (7) 
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Fic. 1.—Calculation of an input’s potential energy opportunity cost 


This inequality will hold whenever the production surface is quasi- 
concave so that the isoproduct curve will be convex to the origin, as is 
yy’. For then, with the slope of yy’ diminishing in absolute value as one 
moves to the right, Ar,/Ar, must always be smaller, absolutely, than 
dr,/dr,. Intuitively, this is so because as more and more 7, is used to 
replace energy in general production, one can be expected to be left 
with energy uses which become ever increasingly difficult to replace, 
for efficiency in the substitution process requires the easy energy- 
replacement opportunities to be used up first. 

Thus, where (as usually assumed) the isoproduct curve has the 
shape of yy’ the relationship equivalent to (7) can be expected to hold 
for all inputs, 7. It then follows that the RHS (right-hand side) of (6) 
can generally be expected to be smaller than that of (4). Since (4) is the 
test of profitability, while (6) is the test of net energy yield, it follows 
that the latter may frequently be easier to pass than the former. That 
is, a new energy manufacturing process may be unprofitable and yet, 
nevertheless, yield a positive net energy output. However, the dif- 
ference can be expected to be small. The reason is that what is at issue 
is the difference between marginal and average opportunity cost, not 
in the energy-producing industry but in the remainder of the econ- 
omy taken as a whole. Since the quantities of input involved are likely 
to be small relative to the economy’s total use of these inputs, the 
range of the isoproduct hypersurface traversed can be expected to be 
negligible. Thus the change in slope and, consequently, the difference 
between (4) and (6) should normally be small.!° 


10 It must be reemphasized that the calculation is all predicated on the assumption 
that all outputs other than domestic energy production remain constant. It also ignores 
any price effects that can be expected to follow from a large change in an energy- 
producing activity. 
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We may then arrive at: 

Proposition 1: While unprofitability does constitute grounds for suspi- 
cion that a new energy process will be a net energy loser, it cannot be 
taken necessarily to reject that process on net-energy grounds. 

However, even this guarded conclusion suggests a judgment more 
sanguine than the situation warrants, as will be shown in the next 
section. 


IV. Marginal Unprofitability Implies Negative Marginal Yield of 
Net Energy 


We will show now: 

Proposition 2: If firms in the economy minimize the cost of produc- 
ing their output vectors, a marginal increase in energy output from 
any process that has a negative profit yield must also have a negative 
potential marginal yield of net energy when other outputs are held 
constant. 

This conclusion has policy implications which are extremely dis- 
turbing: 

Corollary: If a firm that invests in an energy-producing process can 
capture all of its profit yield (1.e., if there are no externalities or similar 
market failures), any expansion in the manufacture of energy (be- 
yond the quantity that maximizes profits) that is induced by subsidy 
unaccompanied by a specially designed program of taxes on energy 
use elsewhere in the economy must be excessive in the sense that (at 
least) a small reduction in that energy output can increase the economy’s 
energy stocks if all other activities in the economy are unaffected.!! 

Put alternatively, since such an expansion in the energy-producing 
process may well lead indirectly to a reduction in other activities in the 
economy, some energy may well be saved as a consequence. But any 
saving in energy would have been greater if those other activities had 
been reduced in the same way without the expansion in the energy- 
production process that is subsidized. 

In other words, whenever a subsidy to an energy-producing activity 
leads to its expansion beyond the level to which it would be brought by 
market forces, it can be expected to have a negative (marginal) net 
energy yield in this sense. That is, under the circumstances postulated 
any subsidy may make matters worse, not just in cost-benefit terms, as 
we might normally expect such a subsidy to do, but in terms of 
physical energy supplies. 


“This assumed implicitly that the profit surface is concave (hill shaped) and that in 
the absence of subsidy the firm will operate at the point of maximum profit or beyond it 
(as it will, e.g., under sales maximization). Then the subsidy-free output must corre- 


spond to a downward-sloping portion of the relevant cross section of the profit hyper- 
surface, and the result applies. 
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The proof of this disturbing proposition is straightforward. Nega- 


tive marginal profitability (on our parametric price assumption) is 
described by” 


pidy, < Xpjdry;, (8) 
but by (3), (8) can be rewritten as 
dy, < dry + > dryjOry/Or;, (9) 
j=2 


which tells us that the marginal net energy yield of output y, is 
negative. Q.E.D. 

One remarkable feature of this proof is that it does not rest on a 
restrictive premise such as the presence of perfect competition. The 
key assumption which underlies (3) is that firms minimize the total 
cost of whatever output they produce. But this holds for a wide 
variety of market forms—for example, profit-maximizing oligopolists, 
monopolists or monopolistic competitors, and firms which maximize 
total revenue (sales)!? or growth rather than profits.'4 

Nor is it necessary that this premise be satisfied throughout the 
economy, as long as it holds for the sectors where the inputs will go if 
they are not used in energy production, and, in particular, (3) does 
not have to be satisfied in the energy-producing sector itself. That is, 
we do not have to assume that the economy is initially operating 
efficiently, for the opportunity cost, as we have seen, is calculated not 
in the energy-producing sector but in the remainder of the economy 
where input) can be substituted for energy use. Thus, if somewhere 
else in the economy least-cost production occurs and satisfies p/p, = 
Or ,/Or;, a trade-off on the terms indicated is indeed possible.*® And if 


2 This assumes that energy price, p,, is determined on the world market and is 
therefore uninfluenced by the output of the domestic industry that manufactures 
energy. If this is not so, Ra = p,(dy, — dr,,) should be replaced by R, = d(p,y, — pir) = 
(¥1 — 711)dp1 + pi(dy; — dr,,). With a negatively sloping demand curve, when dy, > 0 then 
R, <R,. This means that when marginal profit is zero, the marginal net energy yield will 
be positive so that some subsidy to the output of a profit-maximizing producer of energy 
will in this case contribute to the economy’s energy stocks. However, since it seems 
unlikely that the magnitude of the output of any one producer of synthetic fuels, e.g., 
will be sufficient to have a significant effect on international oil prices, this point does 
not seem particularly important for policy. 

13 However, our result does have special implications for firms with objectives such as 
sales maximization which drive them beyond the point of maximum profits even in the 
absence of subsidy. If energy production is in the hands of such a firm, our result shows 
that even without subsidy, its energy output will be carried to a point where its marginal 
net energy yield is negative, so that a restriction in its output will actually increase the 
economy’s energy stocks. Thus a tax on such an activity, rather than just a zero subsidy, 
is called for, at least in theory. 

14 Thus this differs from the usual conclusion that subsidy will reduce welfare in that 
our current conclusion is expressed in physical rather than psychological (utility) terms 
and, unlike the usual result, does not require universal perfect competition. 

Thus the neoclassical assumption of substitutability at the margin, as implied in 
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this relationship holds in those sectors where the resources will actu- 
ally be used, such a trade-off will in fact occur. 


V. On the Reality of the Energy Opportunity Costs 


We must be careful not to exaggerate the strength of our result. An 
energy opportunity cost clearly does represent a potential energy loss, 
and in that sense an expansion of an energy-production process does 
represent an energy loss relative to what would have been achieved if 
its resources had instead been used in an optimal program to increase 
energy stocks. But in reality, if the energy-producing activity were 
reduced from y,;, its magnitude under a subsidy, to y,,, its profit- 
maximizing level, the inputs released as a result might very well not be 
used in a way that maximizes their contribution to the energy pro- 
gram. First, they might not be used efficiently and, second, they 
would probably be used in part for other purposes. They would be 
used in an efficient manner if (3) were to hold everywhere at either 
level of y,, that is, if all firms in the economy were to produce 
whatever output vectors they choose to supply at the minimum cost 
for that vector. But if that is not so, the actual energy loss resulting 
from the increased use of other inputs needed to raise y, to y;, may 
well be less than our calculations indicate. In addition, whether or not 
they were reallocated efficiently, in practice, only some portion of 
those reallocated resources would be used as a substitute for oil. That 
is why the corollary to proposition 2 has been put rather guardedly. 

Nevertheless, those losses will almost certainly be real (and very 
possibly substantial, as is suggested by the material reported in n. 6 
above). That is, if the energy-producing activity is in fact reduced and 
the corresponding input quantities are thrown on the market, the 
forces of supply and demand can be expected to cause a readjustment 
in relative input prices which induce the employment of at least some 
of those inputs elsewhere in the economy. Those other inputs will then 
in fact serve (in part) as energy substitutes. Thus the marginal energy 
opportunity cost of the labor, materials, and other primary inputs 
used in energy production will represent some actual loss in energy 
stocks and not a remote possibility of substitution which is perhaps 
unlikely to occur.’® Interpreted in this sense, our propositions no 


p/p, = 9r,/dr;, need not hold throughout the economy. For the weaker interpretation 
of our result (see next section) to be valid, all that is required is that somewhere in the 
economy the resources dr,, absorbed by the energy sector when its output is increased 
by dy,, be substitutable for energy. It seems hard to imagine that this weak premise will 
ever fail to be satisfied. 

'6 See n. 2 above. We may inquire briefly about the implications of violations of some 
of the premises underlying proposition 2. In particular, we will consider (a) the effects 
of the presence of specialized inputs in the energy-using industry which are not usable 
elsewhere, (b) monopsony power in the input-using industry, and (c) cases where 
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longer guarantee that a subsidy will always cause an incremental net 
energy loss, but they nevertheless leave it under a dark shadow of 
suspicion. 

We can put the matter another way by recognizing explicitly that 
there are two different opportunity-cost concepts, which we may refer 
to as the potential and the actual opportunity losses. In our case, the 
former is the maximal amount of petroleum that can be saved by 
substitution of some quantity of another resource, Ar;, while the latter 
refers to the amount of oil that will actually be saved by the use to 
which Ar; will in fact be put when it is released from a subsidized 
activity. Our formal propositions evaluate the Opportunity cost in 
terms of potential opportunity loss, but there is good reason for 
policymakers to be concerned with the actual opportunity loss, whose 
magnitude, while undoubtedly substantial, is certainly lower than the 
potential-loss figure. 

We can now use a diagram to suggest another way of looking at the 
analysis, which may help to remove any air of sleight of hand that may 
characterize our results.’ In figure 2, two production-possibility 
frontiers are represented for the economy, showing the variables, net 
energy outputs and other outputs, on the axes. Here YY° represents 
the production-possibility frontier relevant without the subsidy for 
the energy supply technique, while Y'Y” represents the production- 
possibility frontier relevant with the energy supply subsidy. This latter 
frontier is drawn below and to the left of the former since the subsidy 
reduces economic efficiency. Here the reduction in economic effi- 
ciency can be approximated by the following integral: 


Ey 
J S(E)dE, 
Eo 


where S(E) represents the subsidy offered per unit of supply, E 
represents the quantity of energy produced using the technique in 


management derives utility from the hiring of a quantity of input j greater than that 
which minimizes cost. The first of these works to undermine proposition 2, since such 
specialized inputs enter the incremental profit expression as a money cost though they 
incur no energy opportunity cost. Thus they may cause failure of the profit test even 
when the net energy test is passed. However, the next type of violation of our premises 
can be expected to work the other way, strengthening proposition 2. Monopsony power 
over input 7 means that in equilibrium 0r,/dr; = mc,/p, > pj/p,, since we expect the 
supply price of 7 to be rising (here mc; is the marginal cost of purchases of j to the firm 
that uses it as an input). Hence the true opportunity cost of j, 0r,/0r;, will be greater 
than p,/p,, the corresponding figure in the profit equation. Finally, excessive purchases 
of j, because they provide utility to management, need not affect our result. With a 
given stock of resource it is impossible for this to be a universal phenomenon. But if it 
occurs only in a limited sector of the economy, elsewhere j will become scarcer, its 
relative marginal yield will rise, but its price will rise correspondingly, and p/p, = 
Or,/dr ; will continue to be satisfied by firms in the remainder of the economy (i.e., those 
which do not buy excessive amounts of /). 

7 We are grateful to James Sweeney and Robert Willig for suggesting this way of 
looking at the matter. The next several paragraphs follow a draft by Sweeney. 
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Net Energy Outputs 





Other Outputs 


Fic. 2.—Production-possibility frontiers with and without energy supply subsidy 


question, Ey represents its initial output, and E, represents the output 
under the subsidy. In figure 2, A represents the initial output combi- 
nation in the absence of any subsidy, and B represents the output 
combination with the subsidy. These points have been drawn under 
the assumption that the subsidy in fact increases net energy outputs 
and decreases the outputs of other goods and services. Point C repre- 
sents a combination of net energy outputs and other outputs available 
if a perfectly efficient mechanism, such as a tax on the imports of all 
energy, had been instituted. The difference in net energy outputs 
between C and B is what we call the potential net energy loss asso- 
ciated with this subsidy. Note that, although the subsidy program can 
increase the actual net energy output, there is a loss in potential net 
energy output relative to what could be obtained with a perfectly 
efficient program. 

Alternatively, the distance from 4 to D can be viewed as the poten- 
tial net energy output loss associated with a subsidy program. This is 
the net energy loss if the economy is managed in such a way that all 
other outputs remain constant. 

Thus we see that even though a subsidy program may increase 
actual net energy output, it can be expected to lead to a decrease in 
potential net energy output. 


VI. Promise of Future Profitability: A Dangerous Fallacy 


Before we turn to our overall conclusion, it is important to deal with a 
critical error in reasoning which the preceding analysis may well elicit. 
Undoubtedly it will suggest the response that many new energy 
sources, while admittedly unprofitable today (both at the margin and 
overall), should become viable in the foreseeable future if enough R & 
D is devoted to the task. This is probably true, but it weakens neither 
the analysis nor its policy implications, because the issue is not 
whether they will ultimately begin to yield positive profits, but 
whether those profits can be expected to be sufficient to compensate 
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for the initial losses. To drive home the point, consider each of the 
two possibilities explicitly. 

Case a: If future profits are insufficient to make up for earlier losses 
at the margin, our analysis still shows that over its lifetime the invest- 
ment will be a potential net energy loser. It will yield some net energy 
gains in later years, but not enough to make up for the resource 
depletion it caused earlier. To see how this is implied by our previous 
analysis, we need merely interpret the quantities in our equations to 
represent discounted present values of the corresponding future 
streams, and the result follows at once: The present value of the 
stream of future profits serves as the required indicator of the present 
value of the stream of values of the future net energy yields, and 
propositions 1 and 2 continue to apply. 

Case b: Where expected future profits do make up for earlier losses, 
the corresponding net-energy tests will, of course, be passed. But in 
the absence of capital market imperfections and externality problems, 
there should then be no need for government subsidies. Private 
enterprises should, in these circumstances, be expected to invest ade- 
quately in the project. After all, any firm that invests in an innovation 
normally expects to lose money initially and to make it up later. There 
is simply nothing unusual in such cases. 

Thus we conclude that the likelihood that with sufficient research 
outlays a new energy source can be made viable financially at some 
point in the future does not weaken the presumption that subsidies to 
such projects will lead to a net loss in the economy’s potential net 
energy stocks. 

Here it should be noted, incidentally, that the research process itself 
has its energy costs—direct, indirect, and those that take the form of 
energy opportunity costs. (One fusion-research project, e.g., is re- 
ported to use as much electricity as a medium-sized American city.) It 
may also be appropriate at this point to take note of another poten- 
ually misleading argument on our subject. It is tempting to assert that, 
because the energy issue is not exclusively a matter of growing scarcity 
of a resource but involves freedom of the nation’s foreign policy from 
the threat of international blackmail, it may well pay society to invest 
in energy-production projects to an extent that is unprofitable for the 
private sector. By itself, this assertion is legitimate enough, but un- 
fortunately there is no way such investments will help to deal with the 
international blackmail problem if the projects undertaken are net 
energy losers. In short, we are by no means advocating adherence to a 
profitability criterion for its own sake. We are merely suggesting that 


* It can, however, be argued (and, indeed, it has been) that it is inappropriate to 
discount a flow of benefits to future generations. This is not the place to go into the 
complexities of this difficult issue—the proper standards of justice in intergenerational 
distribution. 


go08 JOURNAL OF POLITICAL ECONOMY 


profit can often serve as a useful indicator of the net energy contribu- 
tion promised by such an undertaking. If its net energy yield over its 
lifetime will be negative, there is no way it can be justified on 
geopolitical grounds any more than it can be rationalized in economic 
terms. 


VII. Heterogeneity of Energy Sources 


There is another assumption underlying our analysis which can easily 
lead to a second erroneous conclusion on policy. We have assumed all 
forms of energy to be homogeneous, but in fact they are not. Coal is 
very different from oil in terms of the uses to which it can easily be 
put, in transportability, in terms of environmental effects, and so 
forth. Such differences may sometimes be able to justify the adoption 
of projects which are (marginal) net energy losers. Suppose one type 
of energy (petroleum) is in short supply while another (coal) is very 
abundant. An effective process to transform coal into oil may then be 
worthwhile even though the nation’s inventory of British thermal 
units is reduced in the process. 

Heterogeneity may take yet another form—that of differences in 
the dates at which different energy supplies become available. Sup- 
pose we were to believe that within a century, fusion will supply all our 
energy needs safely, abundantly, and cheaply. The energy supplies 
which will not be used before the twenty-second century may become 
valueless or nearly so. If this is, for example, true of much of our coal 
reserves, a net energy loss involved in the trade of coal tomorrow for 
liquid fuel today may well be justifiable. 

But the conclusion is emphatically not that the net energy consider- 
ations emphasized in this paper can be safely ignored. Indeed, it is 
remarkable that the robustness of our basic theorem (proposition 2) 
extends it to cover the issue of heterogeneity of energy sources. That 
is, we will see now that in the absence of externalities, a subsidy to a 
particular form of energy (such as current output of petroleum 
from coal) must actually reduce the potential net output of that form 
of energy (it must reduce net petroleum stocks even if other forms of 
energy are used as an input in the process). In showing this result, we 
can easily deal with an economy having any arbitrary number of 
forms of energy. But the logic of the argument as well as the nature of 
the formal analysis can be brought out most clearly by assuming that 
there are only two forms of energy, one of which (call it “petroleum”) 
is directly substitutable for imported energy, while the other (call it 
“coal”) is not directly substitutable for imported fuel and is available in 
large amounts which may never be used up.!® We then have: 


*® Obviously, proposition 3 will not hold if coal is a homogeneous and totally free 
economic good which can be substituted for petroleum without cost. But coal does cost 
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Proposition 3: Under the conditions of proposition 2, the expansion 
of any petroleum-producing activity which involves a negative margi- 
nal profit yield will involve a net reduction in petroleum stocks at the 
margin, even if substantial amounts of coal are employed in the 
process, if output of coal and other outputs are held constant. 

That is, such an unprofitable increase in petroleum output may well 
exacerbate the economy’s energy importation problem, involving in- 
creased use of coal as a substitute for petroleum! 

At first glance this result may be even more implausible than propo- 
sition 2 itself, but its proof is simply a matter of trivial reinterpretation 
of the steps of the proof of proposition 2. For this purpose we need 
merely consider y, to be manufactured petroleum (e.g., oil extracted 
from shale or from coal), 7, and r,, to be the inputs of petroleum alone 
(rather than total energy input), and 7, and 1, to be coal inputs. Then 
(10) immediately constitutes our result, for it tells us about the net 
marginal yield of commodity | (petroleum) alone, regardless of the 
quantities of input 2 (coal) or other resources used to produce it. 

One may well argue vis-a-vis this proposition that it is unreasonable 
to assume that a subsidy, say, to conversion of coal to oil will leave 
other outputs unchanged, for it will surely affect at least the gross 
output of coal and perhaps the gross outputs of other energy sources. 
Nevertheless, our interpretation of proposition 2 in relation to this 
issue remains valid here. That is, the petroleum saved by a reduction 
in other activities in the economy may offset the oil lost by the 
expansion of the subsidized activity, but in that case the gain in oil 
inventories would have been still greater if those other outputs had 
contracted in the same way without the subsidy-induced increase in 
conversion of coal into oil. 

However, in this case the behavior of substitute energy sources is 
actually apt to exacerbate the problem rather than relieve it: (a) The 
output of other forms of energy used as inputs in the subsidized 
process (the coal in our example) will expand and therefore increase 
their own use of oil as an input, thus tending to reduce the overall 
petroleum yield;”° (b) the outputs of other forms of energy (say, 
electricity) may be reduced because subsidized oil becomes somewhat 
cheaper, but if this is so it will only occur because oil consumption is 
increased and substituted for other types of energy. On both scores, 


money to extract, and it varies in quality and ease of extraction. Hence, it has a nonzero 
price per Btu which can be expected to rise as coal stocks are used up. 

20 It is, of course, true that in this case the available quantity of coal may be increased, 
but there are two reasons why it will not help to solve the problem: First, by hypothesis 
“coal” is used here to represent an input which, unlike oil, is not in short supply. 
Otherwise, there is nothing gained by sacrificing 10 Btu of coal for 5 Btu of oil, the issue 
we are now discussing. Second, by proposition 2, the subsidy must still have a negative 
potential net energy yield of coal and oil together as well as in oil alone (proposition 3). 
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then, the net energy loss in the subsidized production of oil is likely to 
be aggravated by the response in output of other types of energy. 


VIII. Oil Tariff as Stimulus to Supply 


We began our discussion by contrasting the net energy consequences 
of a targeted subsidy program with those of a tariff on imported oil. It 
is time to turn to the latter. Obviously, a tariff will tend to reduce oil 
use by both consumers and industry as well as stimulate energy 
supplies from both conventional and innovative sources. 

Standard economic analysis also suggests that such a tariff will 
accomplish all this in an efficient manner.”! If the magnitude of the 
tariff corresponds to the size of the marginal detrimental externality 
represented by increased dependence on OPEC, in a competitive 
world it will move the economy to a Pareto optimal point on the 
efficiency frontier. This point will involve larger oil stocks and smaller 
outputs of at least some other commodities than would otherwise have 
been produced. All this would, of course, be accompanied by some 
reduction in the consumption of those other goods by households and 
other consumers. 

Nevertheless, our analysis may well appear to have shown that this 
benefit of the tariff is likely to be offset in part by its supply conse- 
quences. To the extent that it serves, in effect, as a subsidy to domestic 
production, it seems likely to lead to changes in output whose net 
energy yields are negative, with the tariff protecting producers from 
the reductions in profit that would otherwise follow. This view of the 
matter is, however, incorrect. Since a tariff raises the cost of imported 
energy everywhere, it will not only increase the gross output of 
domestic production of energy in export-substitutable forms; it will 
also cause substitution of other inputs for use of such types of energy 
elsewhere in the economy. This will cause adjustments in the energy 
opportunity costs of those inputs, which will mean that any profitable 
expansion in domestic energy production resulting from the tariff 
will in fact have a positive net energy yield. This is because the 
increased substitution, throughout the economy, of other inputs for 
oil will cause the marginal rate of substitution (MRS) to fall; that is, an 
additional unit of such an input will be able to replace less oil than it 
could before. In this way the MRS between oil and any other input 
elsewhere in the economy will remain equal to its marginal product in 
innovative petroleum-producing activities. 

In terms of figure 2 the point is that the tariff will move the 
economy from a point such as A to one such as C, with an increase in 
oil output and a loss in efficiency, while the subsidy will, instead, move 
us to a point such as B, causing the potential net energy loss BC. 


*1 For an excellent discussion along these lines, see Schmalensee (1979). 
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Somewhat more formally, we see that the pretariff output of an 
innovative energy source will be smaller than the amount that 
maximizes profits after the imposition of the tariff. As we have seen, 
an increase in output that offers a positive marginal profit must also 
have positive marginal net yield of energy. It follows that the net 
energy yield of the expansion in output of manufactured energy after 
imposition of the tariff must be positive. 

Nevertheless, there are some reservations which must be expressed 
about the tariff approach. First, if a tariff induces an increase in 
consumption from domestic sources, it may simply hasten the process 
of depletion of our oil inventories and speed the economy’s growth in 
vulnerability to the cartel! 

One can also think of a case in which subsidy of a net energy losing 
project with high sunk costs (e.g., high research costs but low variable 
costs) may conceivably be superior to a tariff. If, once the resources 
are expended, it constitutes a credible threat which prevents the cartel 
from raising its price in the future, it may be worth the outlay. It may 
be superior to a tariff which will also lead to domestic investment 
outlays on energy production and, in addition, result in higher prices 
to consumers.”” 


IX. Concluding Comment: Subsidies Which Do Increase Energy 
Supplies 


We began by denying that we were opposed to all programs involving 
subsidies to new energy sources. Our objective, rather, is to make 
absolutely clear how easily an ill-considered subsidy program can 
backfire and may actually serve to deplete the economy’s energy 
supplies. But we may now seem to have proved too much—to have 
shown that any subsidy can be presumed to cause energy losses. In 
fact, we have not done so. Rather, properly interpreted, our results 
indicate the conditions under which such outlays can contribute 
energy resources. This is made possible by the role of externalities 
that may produce cases in which a project’s net energy yield offers no 
corresponding net profits to the firm that undertakes it. 

There is no point in attempting to provide an exhaustive list of the 
ways in which such sources of market failure may arise. We will only 
mention two which, we suspect, are of prime importance here and 
which should illustrate the sort of projects for which subsidies can be 
justified. The two externalities which will be discussed are those re- 
sulting from differences between private and social risk and those 
which arise from the free-rider problems engendered by investment 


in research. 


22 We are indebted to Robert Willig for pointing out this possibility to us. 
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There are at least two reasons for which the social risks of an 
investment may be smaller than those incurred by a private firm 
which undertakes it, so that the investment may be desirable for 
society even though it is unattractive to private enterprises. The first is 
the well-known argument of Arrow and others which points out that 
since at any given time the economy contains a substantial number of 
investments which incur risks that are to a considerable degree inde- 
pendent, the “law of large numbers” reduces the risk to society of an 
additional project below what it is to a private entity which has a 
considerably smaller number of projects under its control. 

Second, financial risk is likely to be more threatening to the indi- 
vidual owner of an enterprise than it is to society. If the project goes 
bankrupt but continues to operate in receivership, the investment is 
by no means lost to the community, though it is lost completely to the 
initial investor. So, on these grounds, too, new energy projects, which 
are indeed risky, may incur social costs and, hence, energy costs lower 
than their energy yield even if they are unattractive to private inves- 
tors. 

In practice, perhaps the most important justification for subsidy to 
particular energy production projects is the free-rider problem which 
accompanies all applied research. Research almost always generates 
information which is useful to others besides those who have con- 
ducted or financed it, and there is often no way to make those others 
bear an appropriate share of the cost. Thus, taking all beneficiaries 
together, the total yield of the research may well exceed its costs, and 
our analysis can be shown to suggest that an energy project which 
does so can be presumed to have a positive net energy yield.2? Never- 
theless, because some of the beneficiaries do not make a correspond- 
ing financial contribution toward the cost of the research, such a 
project may well be unattractive to any private entrepreneur. 

The patent system is, of course, intended to deal with this free-rider 
problem, and to some extent it does. However, some research results, 
particularly those that qualify as products of “basic research,” are not 
patentable. Others are early steps in a development process which will 
occupy many years—so many that the period of patent protection is 
exceeded. And even results which are eligible for patent protection 
may nevertheless throw off substantial indirect benefits to free riders 
who will be helped by them to produce substitutes for the patented 
item. This is frequently suggested as the explanation for the fact that 


** This is fairly obvious if the free riders use their unpaid-for knowledge to produce 
more energy; their energy output must be added to that of those who conducted the 
research. However, the argument applies even if the free riders use the knowledge in 
other economic activities, since the product of those activities will also have an energy 
opportunity value. For example, if the new knowledge reduces the inputs needed to 
produce a unit of some other product, the inputs saved can be used as a substitute for 
energy elsewhere in the economy, in the manner already discussed. 
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many firms choose not to patent the results of their R & D activities, 
thereby avoiding the disclosure required for a patent application. 

These are probably the prime reasons no private firms will under- 
take the research outlays required for a protracted and risky de- 
velopment process such as that required for very innovative energy 
sources like nuclear fusion, which is generally not expected to yield a 
financially viable product until we are well into the twenty-first cen- 
tury. 

In conclusion, let us emphasize again that even if a benign energy 
source such as solar energy does not pass the test that qualifies it for 
subsidies under our criterion, it may nevertheless be socially desirable 
to encourage its production. But a tariff on oil importation or a tax on 
the consumption of oil would appear to be the appropriate way to do 
so. 
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A Rational Theory of the Size of Government 
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In a general equilibrium model of a labor economy, the size of 
government, measured by the share of income redistributed, is de- 
termined by majority rule. Voters rationally anticipate the disincen- 
tive effects of taxation on the labor-leisure choices of their fellow 
citizens and take the effect into account when voting. The share of 
earned income redistributed depends on the voting rule and on the 
distribution of productivity in the economy. Under majority rule, the 
equilibrium tax share balances the budget and pays for the voters’ 
choices. The principal reasons for increased size of government 
implied by the model are extensions of the franchise that change the 
position of the decisive voter in the income distribution and changes 
in relative productivity. An increase in mean income relative to the 
income of the decisive voter increases the size of government. 


I. Introduction 


The share of income allocated by government differs from country to 
country, but the share has increased in all countries of western 
Europe and North America during the past 25 years (Nutter 1978). 
In the United States, in Britain, and perhaps elsewhere, the rise in tax 
payments relative to income has persisted for more than a century 
(Peacock and Wiseman 1961; Meltzer and Richard 1978). There is, as 


We are indebted to Karl Brunner, Dennis Epple, Peter Ordeshook, and Tom Romer 
for many helpful discussions and to the participants in the Carnegie-Mellon Public 
Economics Workshop, an anonymous referee, the editor, and the Interlaken Seminar 
for constructive comments on an earlier version. 
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yet, no generally accepted explanation of the increase and no single 
accepted measure of the size of government. 

In this paper, the budget is balanced.! We use the share of income 
redistributed by government, in cash and in services, as our measure 
of the relative size of government and develop a theory in which the 
government's share is set by the rational choices of utility-maximizing 
individuals who are fully informed about the state of the economy 
and the consequences of taxation and income redistribution.2 

The issues we address have a long intellectual history. Wicksell 
(1958) joined the theory of taxation to the theory of individual choice. 
His conclusion, that individual maximization requires government 
spending and taxes to be set by unanimous consent, reflects the 
absence of a mechanism for grouping individual choices to reach a 
collective decision. Following Downs (1957), economists turned their 
attention to the determination of an equilibrium choice of public 
goods, redistribution, and other outcomes under voting rules that do 
not require unanimity. 

Several recent surveys of the voluminous literature on the size or 
growth of government are now available (see Brunner 1978; Peacock 
1979; Aranson and Ordeshook 1980; and Larkey, Stolp, and Winer 
1980).? Many of the hypotheses advanced in this literature emphasize 
the incentives for bureaucrats, politicians, and interest groups to 
increase their incomes and power by increasing spending and the 
control of resources or rely on specific institutional details of the 
budget, taxing, and legislative processes. Although such studies con- 
tribute to an understanding of the processes by which particular 
programs are chosen, they often neglect general equilibrium aspects. 
Of particular importance is the frequent failure to close many of the 
models by balancing the budget in real terms and considering the 
effect on voters of the taxes that pay for spending and redistribution 
(see, e.g., Olson 1965; Niskanen 1971; and Hayek 1979). A recent 
empirical study by Cameron (1978) suggests that decisions about the 
size of the budget are not the result of “fiscal illusion,” so the neglect 
of budget balance cannot be dismissed readily. 

We differ from much of the recent literature in three main ways. 


1 All variables are real. There is no inflation. Budget balance means that redistribu- 
tion uses real resources. Public goods are neglected. 

? Ideally the size of government would be measured by the net burden imposed (or 
removed) by government programs. 

3 Larkey et al. (1980) include a survey of previous surveys. Recent surveys by Mueller 
(1976) and Sahota (1978) summarize recent contributions by Downs (1957), Musgrave 
(1959), Olson (1965), Niskanen (1971), Buchanan and Tullock (1972), Riker and 
Ordeshook (1973), and others to such related topics as the determination of equilib- 
rium collective decisions and the effects of government policies on the distribution of 
income. 
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First, voters do not suffer from “fiscal illusion” and are not myopic. 
They know that the government must extract resources to pay for 
redistribution. Second, we concentrate on the demand for redistri- 
bution and neglect any “public goods” provided by government (see 
also Peltzman 1979). Third, we return to the earlier tradition of de 
Tocqueville ([1835] 1965) who associated the size of government, 
measured by taxes and spending, with two factors: the spread of 
the franchise and the distribution of wealth (property).* 

Our hypothesis implies that the size of government depends on the 
relation of mean income to the income of the decisive voter. With 
universal suffrage and majority rule, the median voter is the decisive 
voter as shown by Roberts (1977) in an extension of the well-known 
work of Hotelling (1929) and Downs (1957). Studies of the distribu- 
tion of income show that the distribution is skewed to the right, so the 
mean income lies above the median income. Any voting rule that 
concentrates votes below the mean provides an incentive for redis- 
tribution of income financed by (net) taxes on incomes that are (rela- 
tively) high. Extensions of the franchise to include more voters below 
mean income increase votes for redistribution and, thus, increase this 
measure of the size of government. 

The problem with this version of the de Tocqueville hypothesis is 
that it explains too much. Nothing limits the amount of redistribution 
or prevents the decisive voter from equalizing incomes or, at a 
minimum, eliminating any difference between his disposable income 
and the disposable income of those who earn higher incomes. Incen- 
tives have been ignored. Higher taxes and redistribution reduce the 
incentive to work and thereby lower earned income. Once we take 
account of incentives, there is a limit to the size of government. To 
bring together the effect of incentives, the desire for redistribution, 
and the absence of fiscal illusion or myopia, we develop a general 
equilibrium model. 

Section II sets out a static model. Individuals who differ in produc- 
tivity, and therefore in earned income, choose their preferred combi- 
nation of consumption and leisure. Not all individuals work, but those 
who do pay a portion of their income in taxes. The choice between 
labor and leisure, and the amount of earned income and taxes, de- 
pend on the tax rate and on the size of transfer payments. 

The tax rate and the amount of income redistributed depend on 
the voting rule and the distribution of income. Section III shows how 
income redistribution, taxes, and the size of the government budget 


* We are indebted to Larkey et al. (1980) for pointing out the similarity between de 
Tocqueville and the conclusion we reached in an earlier version and in Meltzer and 
Richard (1978). De Tocqueville’s distribution of property finds an echo in the concerns 
about “mob rule” by the writers of the Constitution. 
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change with the voting rule and the distribution of productivity. A 
conclusion summarizes the findings and main implications. 


II. The Economic Environment 


The economy we consider has relatively standard features. There are 
a large number of individuals. Each treats prices, wages, and tax rates 
as givens, determined in the markets for goods and labor and by the 
political process, respectively. Differences in the choice of labor, lei- 
sure, and consumption and differences in wages arise solely because 
of differences in endowments which reflect differences in produc- 
tivity. In this section, we extend this standard model to capture the 
salient features of the process by which individuals choose to work or 
subsist on welfare payments and show the conditions under which 
these choices are uniquely determined by the tax rate. 

The utility function is assumed to be a strictly concave function, 
u(c, 1), for consumption, c, and leisure, /. Consumption and leisure 
are normal goods, and the marginal utility of consumption or lei- 
sure is infinite when the level of consumption or leisure is zero, re- 
spectively. There is no capital and no uncertainty. 

The individual’s endowment consists of ability to produce, or pro- 
ductivity, and a unit of time that he allocates to labor, n, or leisure, / = 
1 — n. Individual incomes reflect the differences in individual pro- 
ductivity and the use of a common, constant-returns-to-scale technol- 
ogy to produce consumption goods. An individual with productivity x 
earns pretax income, y: 


y(x) = xn(x). (1) 


Income is measured in units of consumption. 

Tax revenues finance lump-sum redistribution of r units of con- 
sumption per capita. Individual productivity cannot be observed di- 
rectly, so taxes are levied against earned income. The tax rate, ft, is a 
constant fraction of earned income but a declining fraction of dispos- 
able income. The fraction of income paid in taxes net of transfers, 
however, rises with income.® There is no saving; consumption equals 


° Reliance on a linear tax follows a well-established tradition. Romer (1975) analyzed 
problems of unimodality using a linear tax and predetermined government spending. 
Roberts (1977), using a linear tax and a predetermined budget, showed that the median 
voter dominates the solution if incomes are ordered by productivity. Linear tax func- 
tions are used also when the social welfare function is used to determine the optimal tax 
(see Sheshinski 1972). The degree to which actual taxes differ from linear taxes has 
generated a large literature. Pechman and Okner (1974) find that the tax rate 1s ap- 
proximately constant. King (1980) writes that most redistribution in the United States 
and the United Kingdom comes from the transfer system, not from the tax system. 
Browning and Johnson (1979) show that conclusions about proportionality of the tax 
rate depend heavily on assumptions used to allocate the burden of indirect business 


taxes. 
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disposable income as shown in (2): 
Cx) = (Lax; c=0. (2) 


If there are individuals without any ability to produce, x = 0, their 
consumption is r 2 0. 

Each individual is a price taker in the labor market, takes t and r as 
givens, and chooses n to maximize utility. The maximization problem 
iS: 


maxes w¢,l) = Smax uly me(li— 1); tea (3) 
n€[0,1] n€&[0,1] 


The first-order condition, 


Ou 
0 = — =udr + nx(1 — t), 1 — n)x(1 — t) 
ys (4) 


Ay text lt hla 10 |3 


determines the optimal labor choice, n[r, x(1 — t)], for those who 
choose to work. The choice depends only on the size of the welfare 
payment, r, and the after-tax wage, x(1 — f).® 

Some people subsist on welfare payments. From (4) we know that 
the productivity level at which n = 0 is the optimal choice is 


EPR KT al) 
oe mG Da aii 2) 


Individuals with productivity below x» subsist on welfare payments 
and choose not to work; n = 0 for x S xp. 

Increases in redistribution increase consumption. For those who 
subsist on welfare, c = r, so 0c/Or = 1. Those who work must consider 
not only the direct effect on consumption but also the effect of 
redistribution on their labor-leisure choice. The assumption that con- 
sumption is a normal good means that dc/dr > 0. Differentiating (4) 
and using the second-order condition, D < 0, in footnote 6 restricts 
Ue: 


Oc _ _Uax(1 — t) — uy 
= = > 0. (6) 


Consumption increases with r for both workers and nonworkers pro- 
vided consumption is a normal good. 

The positive response of c to r takes one step toward establishing 
conditions under which we find a unique value of r that determines 


* By assumption, w is strictly concave, so the second-order condition is negative and 
(4) defines a maximum. The second-order condition is @2u/@n? = D = u,x%(1 — t)? — 
2a len eae ee Os 
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the amount of earned income and amount of redistribution for each 
tax rate. The next step is to show that normality of consumption is 
sufficient to establish that earned income (income before taxes) in- 
creases with productivity. 

Pretax income is 


Ve, bk) — KI Y, Xa ¢) | (7) 


People who do not work, x < xo, have y = 0 and @y/dx = 0. For all 
others, 





Gen eae (8) 
The first-order condition (eq. [4]) yields 
ON a thee IX Lt) Ue Lt) (9) 
Ox al) 


The sign of @n/dx is indeterminate; as productivity increases, the 
supply of labor can be backward bending. Pretax income, y = nx, does 
not decline, however, even if n falls. Substituting (9) into (8) and 
rearranging terms shows that the bracketed term in (10) is the 
numerator of dc/dr in (6). Hence, Oy/dx is positive for all x > xp 
provided that consumption is a normal good: 


2 Eee AL Ly a eae = ey | > 0. (10) 


The final step in establishing that there is a unique equilibrium 
solution for any tax rate uses our assumption that leisure is a normal 
good. The government budget is balanced and all government 
spending is for redistribution of income. If per capita income is y, 
then 


ty =r. Ct) 


Let F(-) denote the distribution function for individual productivity, 
so that F(x) is the fraction of the population with productivity less than 
x. Per capita income is obtained by integrating: 


co 


y =| xn[r, (1 — t)x]dF (x). (12) 
zo 


Equation (12) shows that per capita income, and therefore total 
earned income, is determined once we know xp, ¢, and r. From (5), we 
know that x) depends only on¢ andr, and from (11) we know that, for 
any tax rate, there is at least one value of r that balances the budget.’ 


7 The left side of (11) is nonnegative and is a continuous function of r that is bounded 
by tx, where x is the average of x. 
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If leisure is a normal good, the value of r that satisfies (11) for each ¢ is 
unique.® 

Once r or t is chosen, the other is determined. The individual’s 
choices of consumption and the distribution of his time between labor 
and leisure are determined also. The choice of r or t uniquely deter- 
mines each individual’s welfare and sets the size of government. 


Ill. The Size of Government 


The political process determines the share of national income taxed 
and redistributed. The many ways to make this choice range from 
dictatorship to unanimous consent, and each produces a different 
outcome. We call each political process that determines the tax rate a 
voting rule. 

In this section, we consider any voting rule that allows a decisive 
individual to choose the tax rate. Two examples are dictatorship and 
universal suffrage with majority rule. A dictator is concerned about 
the effect of his decisions on the population’s decisions to work and 
consume, but he alone makes the decision about the tax rate. Under 
majority rule, the voter with median income is decisive as we show 
below. We then show that changes in the voting rules and changes in 
productivity change the tax rate and the size of government. 

The decisive voter chooses the tax rate that maximizes his utility. In 
making his choice, he is aware that his choice affects everyone’s 
decision to work and consume. Increases in the tax rate have two 
effects. Each dollar of earned income raises more revenue but earned 
income declines; everyone chooses more leisure, and more people 
choose to subsist on redistribution. “High” and “low” tax rates have 
opposite effects on the choice of labor or leisure and, therefore, on 
earned income. 

Formally, the individual is constrained to find a tax rate that bal- 
ances the government budget, equation (11), and maximizes utility 
subject to his own budget constraint, equation (3). The first-order 
condition for the decisive voter is solved to find his preferred tax rate: 


~ dy 
+ t—— — yg = 0, 
M dt Ya (13) 
where yq is the income of the decisive voter. 
* The normality of leisure means that dl/dr > 0 and, therefore, 0n/dr = —dl/ar < 
0. Since 





oy -| On 
ay ge Or dF (x) < 0, 
the left side of (11) is a strictly decreasing, continuous function of r. The right side of 
(11) strictly increases with r. This implies that there is a unique value of r that satisfies 


Gal): 
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Roberts (1977) showed that if the ordering of individual incomes is 
independent of the choice of r and ¢, individual choice of the tax rate 
is inversely ordered by income. This implies that with universal suf- 
frage the voter with median income is decisive, and the higher one’s 
income, the lower the preferred tax rate. By making the additional 
assumption that consumption is a normal good, we have shown that 
incomes are ordered by productivity for all r and t. Combining 
Roberts’s lemma | (1977, p. 334) with our results, we can order the 
choice of tax rate by the productivity of the decisive voter.? The 
higher an individual’s productivity, the lower is his preferred tax rate. 

Figure 1 illustrates the proposition and shows the effect on the tax 
rate of changing the voting rule. The negatively sloped line is the 
relation between individual productivity, x, and the individual’s pre- 
ferred tax rate. This line need not be linear. 

The maximum tax rate, tmax, is chosen if the decisive voter does not 
work. An example is x = Xq;. In this case, x S< x»; the decisive voter 
consumes only 7, so he chooses the tax rate (tma,) that maximizes r. 
Any higher tax rate reduces aggregate earned income, tax collections, 
and the amount available for redistribution. From equation (5), we see 
that the maximum tax rate must be less than t = 1. 

As productivity rises from xy to x, the tax rate declines from tmax to 
0. Atxqg = x, the decisive voter is endowed with average productivity 
and cannot gain from lump-sum redistribution, so he votes for no 
redistribution by choosing t = 0.!° From equation (5) and u,(0, -) = », 
we see that everyone works when r = 0. If the decisive voter’s produc- 
tivity exceeds x, t and r remain at zero and aggregate earned income 
remains at society's maximum. 

Changes in the voting rule that spread the franchise up or down the 
productivity distribution change the decisive voter and raise or lower 
the tax rate. Our hypothesis implies that changing the position of the 
decisive voter in the distribution of productivity changes the size of 
government provided x» < xq < x. Major changes in xg have occurred 
in two ways. Wealth and income requirements for voting were re- 
duced or eliminated, gradually broadening the franchise and lower- 
ing the income of the decisive voter. Social security retirement systems 
grew in most countries after the franchise was extended. By increas- 


® The formal statement of the result is: Consider any two pairs (7;, t;) and (79, t2). If t, 
> t,, then for all x: x is indifferent between (r,, t,) and (rg, t,) implies that x’ weakly 
prefers (72, ty) to (r1, t;) for all x’ <x and x” weakly prefers (rj, ¢;) to (r2, t2) for all x” > x; x 
strictly prefers (rj, ¢;) to (rz, t2) implies that x” strictly prefers (7,, t,) to (rz, ta) for all x” > x; 
x strictly prefers (79, ty) to (r,, t,) implies that x’ strictly prefers (rp, t2) to (71, t;) for all x’ < 
x. Note that this result does not require unimodality of voter preferences for tax rates. 

10 We have omitted public goods. In an earlier version we showed that under care- 
fully specified conditions, public goods can be included without changing the result for 


redistribution. 
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ing the number of retired persons, social security systems increase the 
number of voters who favor increased redistribution financed by 
taxes on wages. Some of the retired who favor redistribution also 
favor low taxes on capital, property, and the income from capital. 

The size of government changes also if there are changes in relative 
income, as shown by equation (13), or relative productivity. Conclu- 
sions about the precise effect of changes of this kind are difficult to 
draw. We cannot observe productivity directly and can only infer 
changes in the distribution of productivity, F(-), by observing changes 
in relative income. Recent literature makes clear that these effects are 
disputed (see Sahota 1978; King 1980; and others). Further, we 
cannot deduce the effect of changes in productivity on ¢ directly from 
equation (13). The reason is that y depends ont, so finding the effect 
of changes in relative productivity requires the solution to a nonlinear 
equation in ¢. Instead, we rewrite (13) in a form which involves the 
(partial) elasticities of per capita income (jy) with respect to redistribu- 
tion (r) and the wage rate (x[1 — ¢]). 

Let tr = 1 — t be the fraction of earned income retained. From (12), 
y depends on r and 7 only. The total derivative 


dy _ 37-3: 
wdheac! 1 — ty,’ (14) 


where y, and y, are the two partial derivatives. Substituting (14) into 
(13), we solve for t: 


We is LE IOs 2) 
m= 1+ aG.7) + mG, 7) a 
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where m is the ratio of mean income to the income of the decisive 
voter, y/yg, and the 7’s are partial elasticities. Using the common 
economic assumption that the elasticities are constant, the tax rate 
rises as mean income rises relative to the income of the decisive voter, 
and taxes fall as m falls: 


a _ GDL - 1G) 
dm [m—-1+ 79.7) + mn DP iG) 


Relaxing the assumption of constant elasticities weakens the conclu- 
sion, but we expect the sign of (16) to remain positive provided the 
change in the elasticities is small. 

One of the oldest and most frequently tested explanations of the 
growth of government is known as Wagner’s law. This law has been 
interpreted in two ways. The traditional interpretation is that gov- 
ernment is a luxury good so that there is a positive relation between 
the relative size of government and the level of real income. Recently 
Alt (1980) has questioned this interpretation of Wagner’s idea. Alt 
(1980, p. 4) notes that Wagner argued that there is “a proportion 
between public expenditure and national income which may not be 
permanently overstepped.” This suggests an equilibrium relative size 
of government rather than an ever-growing government sector. 

The traditional statement of Wagner’s law—that government 
grows more rapidly than income—has been tested many times, but 
with mixed results. Peacock and Wiseman (1961), Cameron (1978), 
and Larkey et al. (1980) discuss these tests. Our hypothesis suggests 
that the results are ambiguous because Wagner’s law is incomplete. 
The effect of absolute income on the size of government is condi- 
tional on relative income. Average or absolute income affects the 
elasticities in equation (15), and the relative income effect is given by 
m. 

To make our hypothesis testable, we must identify the decisive 
voter. The applicable voting rule in the United States is universal 
franchise and majority rule. Under this rule, the voter with median 
income is decisive in single-issue elections, as we argued above. Hence 
the median voter is decisive in elections to choose the tax rate, so m is 
the ratio of mean to median income." 


‘The many tests of the median-voter hypothesis using regression analyses are 
inconclusive. One reason is that many of the tests do not discriminate between the 
median and any other fractile of the income distribution (see Romer and Rosenthal 
1979). Cooter and Helpman (1974) use income before and after taxes net of transfers 
to estimate the shape of the social welfare function implicit in U.S. data. They conclude 
that “the assumption that ability is distributed as wages per hour . .. —perhaps the best 
assumption on distribution of ability—vindicates the median voter rule.” 
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IV. Conclusion 


Government spending and taxes have grown relative to output in 
most countries with elected governments for the past 30 years or 
longer. Increased relative size of government appears to be inde- 
pendent of budget and tax systems, federal or national governments, 
the size of the bureaucracy, and other frequently mentioned institu- 
tional arrangements, although the relative rates of change in different 
countries may depend on these arrangements. 

Our explanation of the size of government emphasizes voter de- 
mand for redistribution. Using a parsimonious, general equilibrium 
model in which the only government activities are redistribution and 
taxation, the real budget is balanced, and voters are fully informed, 
we show that the size of government is determined by the welfare- 
maximizing choice of a decisive individual. The decisive individual 
may be a dictator, absolute monarch, or marginal member of a junta. 

With majority rule the voter with median income among the en- 
franchised citizens is decisive. Voters with income below the income of 
the decisive voter choose candidates who favor higher taxes and more 
redistribution; voters with income above the decisive voter desire 
lower taxes and less redistribution. The decisive voter chooses the tax 
share. When the mean income rises relative to the income of the 
decisive voter, taxes rise, and vice versa. The spread of the franchise 
in the nineteenth and twentieth centuries increased the number of 
voters with relatively low income. The position of the decisive voter 
shifted down the distribution of income, so tax rates rose. In recent 
years, the proportion of voters receiving social security has increased, 
raising the number of voters favoring taxes on wage and salary in- 
come to finance redistribution. A rational social security recipient with 
large property income supports taxes on labor income to finance 
redistribution but opposes taxes on income from property. In our 
analysis, there is neither capital nor taxes on property, so the increase 
in social security recipients has an effect similar to an extension of the 
franchise. 

Our assumption that voters are fully informed about the size of 
government differs from much recent literature. There, taxpayers 
are portrayed as the prey sought by many predators who conspire to 
raise taxes relative to income by diffusing costs and concentrating 
benefits, or in other ways (Buchanan and Tullock 1962; Olson 1965; 
Niskanen 1971; Hayek 1979). We acknowledge that voters are ill 
informed about the costs of particular projects when, as is often the 
case, it is rational to avoid learning details. Knowledge of detail is not 
required to learn that the size of government has increased and that 
taxes have increased relative to output or income. Long ago it became 
rational for voters to anticipate this outcome of the political process. 
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Wagner’s law, relating taxation to income, has generated a large 
literature and has been tested in various ways. Our analysis shows that 
Wagner’s law should be amended to include the effect of relative 
income in addition to absolute income. 

Kuznets (1955) observed that economic growth raises the incomes of 
skilled individuals relative to the incomes of the unskilled. In this way, 
economic growth can lead to rising inequality and, if our hypothesis is 
correct, to votes for redistribution. The rising relative size of govern- 
ment slows when the relative changes come to an end and reverses if 
the relative changes reverse in a mature stationary economy. 

The distinctive feature of our analysis is not the voting rule but the 
relation between individual and collective choice. Each person 
chooses consumption and leisure by maximizing in the usual way. 
Anyone who works receives a wage equal to his marginal product. 
Taxes on labor income provide revenues for redistribution, however, 
so everyone benefits from decisions to work and incurs a cost when 
leisure increases. 

The analysis explains why the size of government and the tax rate 
can remain constant yet be criticized by an overwhelming majority of 
citizens. The reason is that at the voting equilibrium nearly everyone 
prefers a different outcome. If unconstrained by the voting rule, 
everyone but the decisive voter would choose a different outcome. 
But only the decisive voter can assure a majority. 

An extension of our argument may suggest why real government 
debt per capita, as measured in the budget, has increased more than 
20-fold in this century. The decisive voter has as much incentive to tax 
the future rich as the current rich. An optimal distribution of the cost 
of redistribution would not tax only the current generation because, 
with economic growth, the future generation will be richer than the 
current generation. By shifting the burden of taxation toward the 
future, income is redistributed intertemporally. 

To pursue these questions more fully and to analyze any effect of 
defense and public goods, it seems necessary to embed the analysis in 
a model with saving, capital accumulation, and public goods and to 
explore the effect of permitting relative shares to change as income 
changes. From an analysis of a growing economy, we can expect to 
develop a rational theory of the growth of government to comple- 
ment our analysis of the government’s size. 
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Intergenerational Transmission of Inequality 
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Unequal inheritance of material wealth is commonly considered 
a major cause of inequality in consumption. However, theoreti- 
cal models of the intergenerational transmission of inequality by 
Becker, Blinder, and Ishikawa imply that unequal inheritance may 
either increase or reduce consumption inequality. Differences in 
inherited wealth resulting from unequal parental incomes increase 
inequality in recipients’ consumption. However, unequal bequests 
caused by differences among families in the endowed ability of 
children or the costs of producing human capital are equalizing. 
Empirical results confirm these predictions: The inheritance re- 
ceived by children is inversely related to both children’s income and 
parental education. Thus bequests are “compensatory” in that 
(ceteris paribus) low-income children inherit more than their ad- 
vantaged contemporaries. 


The ultimate difficulties . . . center around the problem 
of social continuity in a world where individuals are 
born naked, destitute, helpless, ignorant and untrained. 
... The fundamental fact about society . . . is that it is 
made up of individuals who are born and die and give 


An earlier version of this paper, “Inheritance and the Intergenerational Transmis- 
sion of Inequality: Theory and Empirical Results,” was presented in workshops at the 
University of Chicago, Queen’s University, and the University of Western Ontario. 
Workshop participants provided helpful comments. Dorothy Worth provided pro- 
gramming assistance.’ I would like to thank Marvin B. Sussman and Judith N. Cates for 
permission to use their data. John A. Brittain provided a copy of the data. The referee 
and an editor provided constructive comments. Responsibility for any errors is my own. 


[Journal of Political Economy, 1981, vol. 89, no. 5] 
© 1981 by The University of Chicago. 0022-3808/81/8905-0002$01.50 


928 


TRANSMISSION OF INEQUALITY 929 


place to others; and . . . that it is dependent upon the 
utilization of three great accumulating funds of inheri- 
tance from the past, material goods and appliances, 
knowledge and skill, and morale [which] . . . must in 
some manner be carried forward to new individuals 
devoid of all these things as older individuals pass out. 
. .. [T]he institutions of the private family and private 
property, inheritance and bequest and parental respon- 
sibility, afford one way for securing more or less tolera- 
ble results in grappling with this problem. [KNIGHT 
1921, pp. 374-75] 


I. Introduction 


Inequality in economic status and the reproduction of inequality in 
succeeding generations—“social mobility”—continue to be objects of 
social concern and the subject of investigation by social scientists. It 
has long been suspected that the institution of the family plays a major 
role in the generation and transmission of inequality and may severely 
limit the ability of public policy to alter the existing social order 
(Knight 1921). 

The role of the family has been examined in a large number of 
empirical studies conducted within the human capital framework 
(e.g., Parsons 1975a). In contrast to this literature, there exists a 
research tradition which has examined the role of inherited material 
wealth in producing and perpetuating inequality (Stamp 1926; 
Wedgwood 1929; Blinder 1973, 1976a; Harbury and Hitchens 1979; 
Menchik 1979). 

Empirical research on the intergenerational transmission of in- 
equality has, for the most part, been conducted within the context of 
these two mutually exclusive approaches. However, models of family 
behavior recently proposed by Becker (1974), Ishikawa (1975), and 
Blinder (19764) strongly suggest the possibility of substitution be- 
tween alternative modes of transfer and their dependence on a com- 
mon set of ability and opportunity variables. Thus, the “ability” of 
children and the education of parents—important determinants of 
children’s schooling—should also influence parent-child transfers 
of material wealth. 

Recent theoretical models suggest that the family may not contribute 
as much to inequality as is commonly thought. Since parents are 
predicted to adjust the magnitude of intergenerational transfers in 
response to differences in the endowments of children, the institution 
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of the family permits a redistribution of resources which tends to 
equalize consumption, both between parents and children (Stuglitz 
1978; Becker and Tomes 1979)! and among disadvantaged and ad- 
vantaged siblings (Becker and Tomes 1976; Griliches 1979). Because 
of the equalizing role of the family, government redistribution mea- 
sures may have little, zero, or even a perverse effect on the distribu- 
tion of income and consumption (Barro 1974; Becker and Tomes 
1979); 

The purpose of this paper is to empirically test the altruism model 
of the intergenerational transmission of inequality proposed by 
Becker, Ishikawa, and Blinder. In this model, parental bequests of 
material wealth and human capital investments represent substitute 
forms of intergenerational transfer and are simultaneously deter- 
mined. The present paper extends previous work in a number of 
directions. First, previous studies of inheritance (Menchik 1979; 
Adams 1980) have been concerned with estimating the income elas- 
ticity of bequests*—and hence the extent to which unequal parental 
incomes are responsible for inequality in subsequent generations. In 
contrast, this paper emphasizes the response of parental bequests to 
variations in children’s incomes resulting from differences in child 
ability or in the ability of parents to produce human capital in their 
children. Since intergenerational transfers serve in part to redistrib- 
ute resources among family members, when parental income is held 
constant, the amount inherited is predicted to be inversely related to 
the recipient’s income. Thus, within parental income strata, bequests 
of material wealth are predicted to be equalizing. Further, if more 
educated parents are more efficient at producing learning or earning 
skills in their children, the progeny of more educated parents will 
receive higher lifetime incomes from human capital. As a result, the 
equalizing role of the family will also be reflected in an inverse re- 
lationship between inherited wealth and parental education. 

Second, the choice of family size, a variable emphasized in simula- 
tion models (Pryor 1973) and empirical studies (Lebergott 1976; 


‘In the model of Becker and Tomes (1979), it can be shown that, if the marginal 
propensity of parents to transfer resources to children exceeds the inheritability of 
endowments, the relative inequality in the steady-state distribution of income (as mea- 
sured by the coefficient of variation) is less when parents choose the optimum level of 
transfers than if there were no intergenerational transfers.. 

* A limitation of these studies is that estimates of the income (wealth) elasticity of 
bequests do not distinguish between alternative models of bequest behavior. Both 
altruistic models and models of “egoistic” life-cycle accumulation in the presence of 
uncertain lifetimes and imperfect capital (annuity) markets are consistent with a high 
income elasticity of bequests (see Tomes 1979), 
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Blomquist 1979), is treated as endogenous,’ as suggested by the “new 
microeconomic” approach to fertility. 

Third, the observation that 55—60 percent of individuals do not 
receive an inheritance of material wealth (Blinder 1973; Blomquist 
1979) suggests that there are two categories of households: those 
making transfers exclusively in the form of human capital who are at 
a corner solution making no transfers of material wealth; and second, 
households making transfers in both forms. These two groups (re- 
gimes) are predicted to respond differently to variations in parental 
income and education (Ishikawa 1975; Blinder 1976); Edwards and 
Grossman 1977). In the absence of data on material wealth transfers, 
previous investigators (Edwards and Grossman 1977; Leibowitz and 
Friedman 1979) were restricted to indirect tests of this model. The 
present paper reports direct empirical tests of the two-regime model. 

Empirical tests of the model are performed using a unique data set 
which contains information on the inheritance, income, and schooling 
of children and the occupation, education, and family size of parents. 

The empirical results offer strong support for the Becker, Blinder, 
and Ishikawa models of family intergenerational transfers. The most 
striking empirical finding is that the inheritance received by children 
is inversely related to the recipient’s income. This result supports the 
prediction that, within parental income strata, inheritance is 
equalizing—low-income children are “compensated” in the form of 
larger bequests, and high-income children are “taxed” in that they 
receive smaller bequests. Alternative explanations for the inverse 
relationship between inheritance and recipient’s income are that it is 
the result of (i) the work-disincentive effect of nonlabor income on 
earnings, (ii) the payment by donors for greater old-age care ren- 
dered by low-income children, or (iii) the reallocation among siblings 
of shares in a given total bequest. These alternative explanations are 
not supported by the data, leading us to accept the hypothesis implied 
by the Becker, Blinder, and Ishikawa models: differences in inherited 
material wealth perform a role of equalizing the distribution of re- 
sources between successive generations of the family. 

The plan of the remainder of the paper is as follows: Section I 
briefly outlines a model of the simultaneous determination of inheri- 
tance, human capital investment, and family size which captures the 
essential features of the Becker, Blinder, and Ishikawa models of 


3 Blinder suggests that “the taste for bequests . . . would rise with the number of 
children” (1976, p. 90). Easterlin (1976) presents a “bequest” model of fertility in 
which parents choose the number of children consistent with achieving a given target 
bequest to each child. In the model outlined below, the bequest per child is simultane- 
ously determined together with parental fertility. 
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intergenerational transfer behavior by the altruistic family. In Section 
III the data are described, and the model is specified empirically. 
Empirical results are reported in Section IV. Finally, Section V pre- 
sents a summary and conclusions. 


Il. A Model of Inheritance, Human Capital Investment, 
and Family Size 


Let us abstract from uncertainty and life-cycle factors* and consider a 
static, certain world in which individuals, endowed with perfect 
foresight, live for two periods—initially as a “child” and subsequently 
as a “parent.” Generations of the family are assumed to be interde- 
pendent and overlapping. Parental fertility and the transmission of 
resources to children are assumed to be motivated by altruism, in that 
the number of children and their per capita consumption enter the 
parent’s utility function together with the parent’s own consumption. 
This utility function may be written as 


Ul =U WN, es 25) (1) 


where N is the number of children and z, is the per capita consump- 
tion of children during their subsequent tenure as parents. For 
simplicity, per capita consumption is assumed to be the same for all N 
children within a given family. The quantity z, represents the aggre- 
gate consumption of parents.° 

This formulation captures the essence of the Becker, Blinder, and 
Ishikawa models® in which parents derive satisfaction from a gross 
measure of the well-being of children—their lifetime utility, income, 
or consumption. The donor’s utility does not depend on the composi- 
tion of recipient’s consumption (i.e., there are no “merit” goods), on 
direct services (e.g., old-age care), or on the sources of recipient’s 
lifetime income—the division between human wealth and financial 
wealth. 

Individuals are assumed to enter the world with an endowment of 
income determined by such factors as genetic and cultural inheri- 
tance, government fiscal and social policy, exogenous economic 


“This assumption rules out “contingent bequests” when life-cycle savings for future 
consumption become bequests due to the uncertain incidence of mortality. Bevan and 
Stiglitz (1978) and Tomes (1979) discuss such models. In the present model, all be- 
quests are planned and are motivated by parental altruism. 

° All households are assumed to be composed of two parents plus children. I do not 
distinguish between the consumption of individual parents. 

® The differences in the formations of Becker, Ishikawa, and Blinder may be of 
importance for some issues, such as the wealth effects of bond-financed fiscal policy. Of 
greater importance in the present context are the aspects that these theoretical models 
have in common. 
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growth, and random “luck.” For simplicity, I assume that all children 
within a given family possess equal endowments. Thus, endowed 
income may vary across families but not within families.” 

Motivated by their altruism, parents may augment the consumption 
of their children either by investing in their human capital or via 
direct transfers of material wealth.* The relationship between the 
child’s consumption and parent-child transfers is determined by the 
income-expenditure constraint of the child generation and the tech- 
nology described by the income-generating function, 


Z.=1,=e+ Bx "ey +a=e+kr+a, (2) 


where k = Bx'"e” is the constant elasticity human capital production 
function, 0 < y, n < 1. 

The left-hand side of (2) represents the child’s income-expenditure 
constraint which indicates that, in the absence of transfers by the child 
to subsequent generations, consumption equals income.® The right- 
hand side of (2) defines income in terms of its proximate determi- 
nants: e represents the endowed income of the child; k, income from 
human capital; and a, the child’s income from material wealth.’ 
Financial asset markets are assumed perfect, so that the rate of return 
on material wealth is constant and independent of the individual’s 
stocks of human and nonhuman capital. 

In general, both market-purchased goods (e.g., tuition) and 
home-produced inputs (e.g., the time of parents) enter the produc- 
tion function for human capital. For simplicity, we assume that 
market-purchased goods and home-produced inputs are perfect sub- 
stitutes, so that a single aggregate input (x) enters the production 
function. Investment in the human capital of a given individual is 
assumed subject to diminishing returns with respect tox (i.e., 1 > ) > 
0). The return to (home and market) inputs in the production of 
children’s earning skills is also permitted to depend on the child’s 


7 Becker and Tomes (1976) consider differences in transfers to children within the 
same family when children differ in endowed ability. Evidence relating to intrafamily 
inequality in bequests of financial wealth is presented in Sec. IV. 

8 We assume that all parents are at the interior solution where net parent-child 
transfers are positive—i.e., altruistically motivated transfers are nonzero. Even with the 
secular growth of incomes over the generations, regression toward the mean in en- 
dowed income and reverse (child-parent) transfers effected by bond-financed fiscal 
policies and social security programs would tend to make positive private parent-child 
transfers optimal for at least some households. Given the observed prevalence of 
“forward” transfers in forms such as schooling, this assumption seems plausible. 

® Becker and Tomes (1979) consider intergenerational transfers in the context of 
many generations. 

10 For simplicity, stocks of both human capital and material wealth are measured in 
units of the child’s income. Note also that in general e would represent a vector of 
earning and learning abilities. 
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characteristics—in the form of the child’s endowments. To the extent 
that the child’s endowment represents the ability to learn, greater 
child endowments will be associated with greater income for given 
inputs (i.e., y > 0). 

Families are permitted to differ in their ability to produce income- 
earning skills in their children (Leibowitz 1974). The parameter B in 
equation (2) represents the level of parental efficiency, which is as- 
sumed to operate in a manner equivalent to technological progress, 
increasing the next generation’s income for given inputs and child 
endowments. 

In the absence of intrafamily public goods and fixed cost compo- 
nents, and given identical children, the full income constraint of 
parents can be written as’ 


Ip =Zp + pNx + paNa, (3) 


where parental consumption (z,) is the numeraire, Ip is the full in- 
come of parents, p is the unit price of human capital inputs, pq is the 
cost of increasing the asset income of each child by $1.00,’? and pNx 
and p,Na represent (respectively) the total expenditures by parents on 
the human capital and material wealth of the next generation. 


The Commodity Demand Functions 


The allocation problem of parents is to maximize utility (1) subject to 
(2) and (3). The first-order conditions for an interior solution are: 


Up 








v = | = ia, (4) 
Baseball PN | sanieeg ee 
Y Bee — ay Pee ra 
U 1/(1—n) 
Vom tbanelge) + Pann 6) 


where YW is the marginal utility of parental income, U; is the marginal 
utility of consumption by the 7th generation, and Uy is the marginal 
utility of family size. 

The marginal rate of substitution between each commodity and 
parental income is equated to the marginal cost defined by the first- 
order conditions (7; for the jth commodity). The interaction of 


é “ Since all children have equal endowments and the same income-generating func- 
tion (2), human capital investment and material wealth transfers will be equal for all 
children within a given family. 

2 Inthe absence of estate and income taxes, pa = (1 + r)~1, where r equals the 
intergenerational rate of return on assets. 
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“quality and quantity” dimensions of choice is reflected in the fact that 
the marginal costs of heir’s consumption and family size (77, and 7y, 
respectively) depend not only on prices but also on the level of 
commodity outputs. Because of this interdependence, we predict an 
inverse relationship between family size and parent-child transfers. 

The optimum scale and mode of intergenerational transfers are 
determined by the first-order conditions (4) and (5). We assume that 
the (increasing) marginal cost of human capital investment is initially 
below the (constant) marginal cost of asset transfers. If loans from 
outside the family or intrafamily loans from parents are available, all 
households would invest in human capital up to the point where the 
marginal costs of both forms of transfer were equalized. Under these 
circumstances, human capital investment would be independent of 
the “opportunities” defined by parental income and family size and 
would be determined solely by parental efficiency and the child’s 
endowed ability. 

Observation suggests that such equality of opportunity among 
households does not exist as regards human capital investment be- 
cause of the absence of collateral and the high cost of enforcing 
implicit or explicit loan contracts. In the present model, this imper- 
fection in the capital market is modeled in an extreme form by 
assuming that all human capital investment is financed internally 
within the family and neither intergenerational loans nor bequests of 
debt are permitted (i.e., a 2 0). Under this financial liability con- 
straint, all human capital investment constitutes an intergenerational 
transfer. In this context there will exist two categories of families— 
those making transfers on a scale sufficient to make positive asset 
transfers for whom the financial liability constraint is ineffective, and 
those households who would like to transfer debt to the next genera- 
tion but for whom the financial liability constraint is binding. 

For households for whom the financial liability constraint (a = 0) is 
binding, the first-order conditions (5) and (6) require modification to 
become 








Ls -on N a ' 
Paaiesase’ eae hea es) 
U 7 - k 1/1-—) = P 
= = px = o(gr] = Ty. (6') 


This latter category of households will make all transfers in the 
form of human capital, and material wealth transfers will be zero. 
Since, for these households, marginal transfers are in the form of 
human capital investment, the level of such transfers will depend on 
parental income and family size—determinants of opportunities—in 
addition to parental efficiency and child endowments. 
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For both categories of households, the solutions to the appropriate 
first-order conditions can be represented as a set of demand functions 
for family size and the consumption of both generations in terms of 
“family resources” and commodity marginal costs (7's) (Becker and 
Lewis 1973). The demand for the marginal form of transfer repre- 
sents an excess demand function for children’s consumption. 

If the parental utility function (1) is homothetic CES with equal 
elasticities of substitution between all arguments, and if we assume 
that the next generation’s consumption (in present value terms) rep- 
resents no more than half of family resources,” the following sign 
predictions are obtained: 


For a > 0: 
eo 2 So © 
Ce Be eA (7a) 
(CoC mC) 
N= o[l;, 0. & al. (7b) 
Homas—s0: 
ey 2a © 
Pa > 0)= OL; B; k, N], (8a) 
Cy come) 
N = ply, Bs k], (8b) 


where equation (8a) predicts the probability of positive material 
wealth transfers. 

The major predictions of the model can now be summarized.'* 
When asset transfers are positive, greater parental income is pre- 
dicted to increase both asset transfers and family size. Since the 
demand for material wealth transfers represents an excess demand 
for child consumption, the ratio of asset transfers to family size will 
increase with parental income, and further, the income elasticity of 
material wealth transfers is predicted to be decreasing (Becker and 
Tomes 1976). The inverse relationship between quality and quantity 
dimensions of choice is reflected in the negative signs on family size 
and asset transfers in equations (7a)—(7b). 

Material wealth transfers are predicted to be inversely related to 
children’s income from human capital, reflecting the substitution 
between the two modes of transfer. When parental efficiency is held 


'° This latter condition is sufficient to sign some of the comparative static results. This 
condition will hold if the rate of time/generation preference—reflecting both “impa- 
tience” and the fact that consumption on the part of one’s heirs is not the same as own 
consumption—exceeds the rate of return net of the (endogenous) rate of population 
growth, i.e., 5 > r — g > 0, where cae 

‘The comparative statics and predictions of the model are discussed in detail in an 
earlier version of this paper. See also Tomes 1981a, 19810. 
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constant, variations in human capital reflect differences in child en- 
dowments. An increase in child endowments raises the next genera- 
tion’s income for given parental expenditures, which increases family 
resources and therefore the consumption of both generations. The 
increase in parental consumption is financed by reduced parental 
expenditures on children. Since expenditures on children’s human 
capital are an increasing function of child endowments, this requires 
a more than offsetting reduction in material wealth transfers.'’ In this 
way, the reduction in intergenerational asset transfers redistributes 
resources away from children with high endowment incomes and 
reduces the inequality in consumption between members of the next 
generation, as compared with the case when transfers are inde- 
pendent of endowments. 

Equations (7a) and (8a) indicate that the effect of parental efficiency 
on both the magnitude and probability of receiving positive material 
wealth transfers is ambiguous when the recipient’s human capital is 
held constant. However, when the scale of human capital investment 
is permitted to adjust (i.e., k is not held constant), greater parental 
efficiency is predicted to reduce both the magnitude and probability 
of receiving asset transfers.'® Greater parental efficiency in the pro- 
duction of children’s human capital reduces the cost of this mode of 
transfer and increases family resources. This produces both a sub- 
stitution effect, increasing human capital investment and reducing 
material wealth transfers, and an income effect, increasing parental 
consumption, financed by reduced asset transfers. 

A second set of predictions concerns the determinants of human 
capital investment in the two regimes—families making positive and 
zero material wealth transfers. Assuming all variables are measured in 


‘5 Tn the simple case in which y = 7, i.e., the human capital production function (2) is 
CRTS inx and, the elasticity of material wealth transfers with respect to human capital 
transfers: k (holding £, J,, and N constant) is given by: A (d In a/d In k) = —(b(1 — b,) + 
Dele | tie Nee n)a]}), where A = b,(1 — b,a) > 0, the bs are expenditure shares in 
family resources S = Ip + mI: bp = TerZelS, bq = 7.4/8, be = 7-e/S, and b, = pNx/(1 — 
n)S. Since 0 < 1 — n < 1 and the elasticity of substitution 7 < 1 (by the second-order 
conditions), a sufficient condition for the above expression to be negative is that the 
share of children’s consumption (in present value terms) in family resources, b., not 
exceed '. 

16 In the case considered in the previous footnote, the elasticity of material wealth 
transfers with respect to parental efficiency 8, when human capital transfers are 
permitted to vary (holding J,, e, and N constant) is given by: A(d In ald \n B) = 
—b,n~!{1 — b.[1 + (1 — n)o]}. This expression is negative if 6. < 14. Further, it can be 
shown that in the reduced-form equation in which N is permitted to vary (obtained by 
substituting eq. [9a] into [7a]-[7b] and solving for a), greater parental efficiency re- 
duces asset transfers, provided that the returns to parental expenditures on human 
capital do not exceed 12 (Tomes 1981a). Heckman’s estimates of the returns to scale in 
the production of human capital do not differ significantly from 2 (Heckman 1976a, 
table 3A). 
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logs and given the assumptions concerning technology and the utility 
function, these predictions can be summarized as: 


For a > 0: 
(=0) (4+) (+) (=0) 
k=7¥o+ Ylpt Yoe + y28 + yaN. (9a) 
For a = 0: 
(+) (?) (Ge) ©) 
k = po+ Pilp + pre + psB + psN, (9b) 
Pi>y¥i=90, Yeo> Po Ys> Ps, Pax Ya= 0. (9c) 


Since the financial liability constraint is not binding on families 
making positive asset transfers, the scale of human capital investment 
is independent of opportunity variables—implying the zero restric- 
tions in (9a). For households at a corner solution making zero material 
wealth transfers on whom the financial constraint is binding, marginal 
transfers—in human form—depend both on parental income and 
family size. 

An additional testable implication is that human capital investment 
is more responsive to parental efficiency among households making 
positive asset transfers. This follows from the prediction that, for 
households making positive asset transfers, the substitution effect on 
human capital transfers resulting from greater parental efficiency 
exceeds the sum of income and substitution effects experienced by 
families making no material wealth transfers (see Edwards and 
Grossman 1977; Tomes 1978). The predictions summarized in 
equations (7)—(9) are subject to empirical test in Section IV. 


III. Empirical Specification 


When we move from the theory to the empirical specification of the 
model, two points require discussion. First, the theoretical model 
assumed both perfect foresight and certainty. Under these conditions, 
individuals face a single allocation problem at the beginning of their 
parent status and have neither the desire nor the need to revise their 
decisions in the course of their lifetimes. However, in the real world 
both certainty and perfect foresight are lacking. Therefore, based on 
their future expectations, parents determine their fertility, the level of 
human capital investment, and material wealth transfers, with the 
possibility of revision if realized outcomes diverge from their expec- 
tations. Given that in the typical family life cycle completed fertility is 
determined prior to the completion of investment in the human 
capital of children and that material wealth transfers may occur at any 
time until the decease of the parents, these decisions are subject to 
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revision to differing degrees. In addition, human capital investment 
may be completed before the child’s income-earning ability becomes 
known to either generation, since that ability may be subject to 
stochastic influences such as the occurrence of ill health and disability. 
Given that the parents’ fertility is predetermined and given the pre- 
commitment of human capital investment, no adjustment of these 
choice variables may be possible or desirable. However, the level of 
material wealth transfers may be adjusted in the light of such events. 
In the empirical specification of the model I view the dependent 
variables as jointly determined. However, uncertainty and the tem- 
poral ordering of events suggest the introduction of variables which 
measure the realization of events over the life cycle. 

Second, the theory derives predictions concerning the total trans- 
fers of material wealth to the next generation which may take the 
form of gifts during the parental lifetime or bequests at the decease 
of parents. Data on gifts are notoriously difficult to obtain, and the 
data set I use does not contain this information. However, the omis- 
sion of information on gifts presents less of a problem for the present 
study which employs data on small wealth leavers. In these data there 
is little tax incentive in favor of gifts over bequests.’” In addition, the 
prospect of unanticipated changes in the lifetime incomes of children 
and parents creates an incentive to defer material wealth transfers 
until the parents’ decease. The inheritance received by the child is, 
therefore, used as a measure of material wealth transfers. 


The Data 


The data used in this study derive from a 5 percent random sample of 
estates probated in the Cleveland, Ohio, area in 1964-65, yielding 
659 estates (Sussman, Cates, and Smith 1970). Surviving kin and 
other heirs were interviewed, including 657 sons and daughters of the 
decedents. Information was obtained on the total estate, usual occu- 
pation, education, and other characteristics of the deceased and on 
the inheritance, income, education, and other characteristics of the 
surviving kin. The principal variables are briefly described here and 
defined, together with other variables, in table 1. 


17 The mean gross estate of decedents was $12,000, and less than 5 percent of 
decedents’ estates in this sample exceeded $60,000 after deductions and were liable for 
Federal Estate Tax (Sussman et al. 1970, p. 188). Ohio had (in 1964—65) an inheritance 
tax which commenced at 1 percent on the inheritance of children in excess of 
$7,000—almost double the mean inheritance in the data I use. Information gleaned 
from Barlow, Brazer, and Morgan (1966) suggests that the omission of data on gifts 
may not be a serious problem in terms of omitted variable bias and that the income 
elasticity of gifts exceeds that of bequests (for details, see the earlier version of this 
paper). Menchik’s study of large Connecticut estates supports this latter conclusion 
(Menchik 1979, table 4). 
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The inheritance received by the child is used as a measure of 
material wealth transfers. Theory suggests that inheritances will be 
zero for families where financial constraints are binding. In accor- 
dance with this prediction, the observed distribution of inheritances 1s 
truncated—41.5 percent of children (251 cases) received the lower- 
limit value of $250'* from the decedent’s estate. The natural spec- 
ification of the inheritance equation for the complete sample of 
children is a Tobit scheme: 














a= BX =u it BX =e 200, (10) 
a= .250 Tie (OX 1.250); 
TABLE 1 
DEFINITION OF VARIABLES 
SAMPLE OF ALL CHILDREN 
Theoretical 
Variable Empirical Measure Mnemonic 
Material wealth Inheritance received by son/daughter INHR 
transfers ($000s)* 
Human capital Annual family income of son/daughter INCOMER*¢ 
investment (000s)+ 
Years of schooling of son/daughter SCHR 
Family size Total number of kin of decedent TNKIN 
Parental income Constructed measure of decedent’s “perma- | INCOMED* 
nent” income (000s)§ 
Parental Years of schooling of decedent SCHD 
efficiency 
Additional Race of decedent (white = 1, nonwhite = 0) RACE 
parent Sex of decedent (female = 1, male = 0) SEXD 
characteristics Age of decedent AGED 
Age of decedent squared AGEDSQ 
Interaction between age and schooling of AGED - SCHD 
decedent 
Marital status of decedent at death: married MARRIEDD 
(omitted category: widowed, divorced, or 
single) 
Religion of decedent: Jewish, Protestant, JEWD 
Catholic (omitted category: no religion, { PROT 
Eastern Orthodox) CATHD 
Origin of decedent: birthplace in U.S., peereae 
Western Europe (omitted category: birth- LORIGINWE 
place Eastern Europe or other) 
Number of kin of decedent, other than chil-- OTHERKIN 


dren 





'8 Since the original data on inheritances are coded in $1,000 intervals, we were 


unable to distinguish between small inheritances (< $500) and zero inheritances (see 
table 1, §). In the discussion below we use the term “zero inheritance” to refer to 
children receiving an inheritance equal to the lower-limit value coded as $250 and 
“positive inheritance” to refer to an inheritance in excess of $250. 
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TABLE 1 (Continued) 











SAMPLE OF ALL CHILDREN 








Theoretical 
Variable Empirical Measure Mnemonic 
Additional Sex of recipient (female = 1, male = 0) SEXR 
recipient Age of recipient AGER 
characteristics Age of recipient squared AGERSQ 


Marital status of recipient: single, divorced, f SINGLER 
or separated (omitted category: married) | DIV/SEPR 
Number of household members employed | NEMPLR 
Dummy variable for the presence of non- NLABINC 
labor income (1: positive nonlabor in- 
come; 0: zero nonlabor income) 
Sex of breadwinner in recipient’s household SEX 
Frequency of contact between the recipient VISITS 
and decedent! 





*In the original data the child’s inheritance is coded in $000 intervals. An inheritance of less than $500 is 
therefore coded as $250 (i.e., .250), following Brittain (1978, p. 42). Estimates of the model using a limit value of zero 
were virtually identical with those reported. 

+ The reported monthly family income, which was originally coded in categories, was recoded on an annual basis, 
using interval midpoints for the closed intervals and using the (estimated) mean of a Pareto distribution for the 
open-ended interval. 

+ * designates a variable entered in natural logarithm. 

§ The income of recipients INCOMER (for those reporting a positive family income) was regressed on a set of 
“permanent” characteristics of recipients. The estimated coefficients were (R? = .216, N = 608): 


INCOMER® = 3.915 — .325SCHR + .01IAGER - SCHR — .104AGER 
+ .0002AGERSQ + .2660CCR + .302NEMPLR — .093SEX 
+ .627RACE + .494ORIGINWE + .658ORIGINUS + .3200RIGINEE 
+ .668PROT + .539CATH + 1.278JEW, 


where ORIGINEE = 1 if recipient's origin is Eastern Europe and 0 otherwise; OCCR is a seven-category occupation 
of breadwinner code (coded from 1: unskilled; to 7: executive). These coefficients, together with the corresponding 
characteristics of the decedent, were used to construct the income variable INCOMED (NEMPL was set equal to 


unity). 
| The original data on the frequency of contact were coded in categories: daily, weekly, monthly, less often, varies. 
These categories were recoded in terms of the number of visits per year (+ 100). The category “less often” was coded 


as .03, i.e., three visits per year. The category “varies” and missing values were coded at the mean of the nonmissing 
observations. 


where X is the vector of right-hand-side variables and w is assumed 
normally distributed, independent of X, with zero expectation and 
variance o”. In addition, we also present estimates of the determi- 
nants of the probability that an heir will inherit an amount in excess of 
the lower-limit value. 

The level of schooling is the most frequently used measure of 
human capital investinent. However, this variable corresponds to an 
input measure of human capital investment rather than the output 
measure suggested by the theory, which is the income of the child 
attributable to parental transfers. Also, schooling represents a deci- 
sion variable determined early in the life cycle, whereas income repre- 
sents the return to skills and other sources of income realized later in 
the life cycle. Life-cycle considerations suggest that family size may 
depend on children’s schooling, whereas the inheritance received by 
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the child may be more closely related to the child’s realized income 
than the child’s schooling. The empirical model incorporates these 
considerations and includes an equation relating the recipient’s in- 
come to schooling and other characteristics. 

The reported total number of kin of the decedent was used as a 
measure of family size. Since this measure includes kin in addition to 
children, the number of these more distant kin was included as a 
control variable. 

No direct measure of the annual or lifetime income of the decedent 
is available. However, information on the “permanent characteristics” 
of the decedent is available. Given supplementary data on income and 
the same set of characteristics from another source, one can construct 
a predicted income variable for decedents. The subsample of the 
surviving kin of these estates was used for this purpose. First, the log 
of recipient’s family income was regressed on his permanent charac- 
teristics; then the estimated coefficients, together with the corre- 
sponding permanent characteristics of the decedent, were used to 
construct a measure of the decedent’s “permanent” income. The 
education of the decedent was used as a measure of parental effi- 
ciency. When parental income (and hence market efficiency) is held 
constant, a higher level of parental education is interpreted as imply- 
ing a greater level of nonmarket efficiency. Additional characteris- 
tics of the decedent (age, sex, race, religion, origin [birthplace], and 
marital status) and of the recipient (age, sex, marital status, and 
number of individuals employed in recipient’s family) were included. 
Some of these variables hold constant life-cycle factors (e.g., age), 
while others may be interpreted as crude measures of some dimen- 
sions of the child’s endowment. 

Several writers have proposed that parental bequests may represent 
a payment by parents in return for services, such as care during old 
age, provided by children’® (e.g., Sussman et al. 1970; Parsons 1975d). 
If low-wage children possess a comparative advantage in the provision 
of such services, as seems plausible, an inverse relationship between 
inheritance and the recipient’s income could be observed even in the 
absence of altruism. In order to account for this possibility and in 
an effort to test this alternative hypothesis, an additional variable— 
the frequency of contact between the recipient and the decedent 
(VISTT'S)—was included in the inheritance and income equations. If 
some children sacrifice market income in order to care for their aged 


'* E.g., Sussman et al. (1970) state: “Testators will their estates to designated children 
... according to their perception of their needs, emotional ties with them, and services 
exchanged among family members over the years.” Children “generally accept the 
notion that the sibling who has rendered the greatest amount of service to an aged 
parent should receive a major portion of the inheritance” (p. 118). 
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parents in return for a larger inheritance, the VISITS variable should 
enter with a negative coefficient in the income equation and a positive 
coefficient in the inheritance equation. 

The empirical model consists of four equations in which the de- 
pendent variables are the inheritance, income, and schooling of the 
child and the total number of kin of the decedent. Since the determi- 
nants of human capital investment are expected to differ according to 
whether financial constraints are binding, estimates of the complete 
model are presented separately for the two subsamples according to 
whether the recipient inherited in excess of the limit value or the 
minimum amount. 

Finally, in accordance with the previous discussion, life-cycle factors 
suggest certain identifying restrictions. Thus the marital status of the 
decedent at death—the presence or absence of a surviving spouse—is 
assumed to influence only the inheritance and to have no inde- 
pendent effect on the remaining endogenous variables. Also, the 
marital status of the recipient (divorced or separated, married or 
single) enters the schooling, income, and inheritance equations but 1s 
excluded from the family size equation. 


IV. Empirical Results 


Table 2 reports estimates for the sample as a whole. Row | presents 
the Tobit estimate of the inheritance equation, while row 2 presents 
the corresponding two-stage least-squares (2SLS) estimate. The im- 
portant qualitative results are the same, but since the latter does not 
take account of the truncation of the dependent variable, I shall 
concentrate on the Tobit estimate. 

The important finding is that the amount inherited is inversely 
related to the recipient’s income, with an elasticity of —0.92 when 
evaluated at the mean inheritance.” This result contrasts with the 
findings of previous studies using these data.”! Before accepting this 
result as confirmation of the theoretical model, several alternative 
interpretations should be considered. First, since the right-hand-side 


20 The regression in row | omits parental education. When this variable is included, it 
is insignificant (¢ = 0.628), while the recipient’s income is on the margin of significance 
at the 5 percent level (¢ = 1.932). 

21 Brittain (1978), in an analysis of these data designed to detect differences in 
inheritance by the sex of the recipient, reported that “‘need’ for bequests indicated by 
the prebequest economic status of the son and daughter was tested, but found to have 
no influence. This is . . . consistent with equal division among siblings” (p. 45, n. 47). 
However, Brittain’s regression (pp. 42-45) included only the number of children and 
the gross estate of the decedent as regressors in addition to the recipient’s sex. Adams 
(1980) also analyzed these same data to estimate the income elasticity of bequests, but 
since the income of children was used as a measure of the decedent’s income, he did not 
separate the effects of child’s income and parent’s income on material wealth transfers. 
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variable is the recipient’s income rather than wage rate, it may reflect 
the labor supply of family members. Since the receipt of an inheri- 
tance is predicted to increase the demand for leisure, the work- 
disincentive effects of inheritance also imply an inverse relationship 
between inheritance and family earnings”? (Heckman 1976a). In 
order to evaluate this possibility, the inheritance and income 
equations were estimated simultaneously in rows 2 and 3, with the 
recipient’s income permitted to depend on the amount inherited.”? 
The inheritance enters the income equation with a significant positive 
coefficient, suggesting that the income variable (INCOMER) may 
represent a better measure of income gross of the inheritance rather 
than earnings (see n. 22). The income equation reported in row 3 
constrains the family background variables (family size, decedent’s 
income and schooling) to enter recursively via the recipient’s school- 
ing. This assumption is relaxed in row 4 to allow for the direct 
influence of these variables on recipient’s income via lifetime gifts and 
other mechanisms. The conclusions are unchanged—inheritance 
continues to enter with a positive coefficient which is on the margin of 
significance at the 10 percent level. The available evidence does not 
support the work-disincentive hypothesis. 

Second, as mentioned previously, an inverse relationship between 
children’s income and inheritance may also arise if some children 
sacrifice market earnings in order to provide care for their parents, in 
return for a larger bequest. In order to test this possibility, the VIS- 
ITS variable was introduced into both income and inheritance 
equations. This variable enters the income equation with a significant 
negative coefficient, suggesting that the opportunity cost of child- 
parent interaction is positive. Surprisingly, the VISITS variable enters 
with a significant negative coefficient in the inheritance equation.”* 


22 In a single-period labor/leisure model the percent change in earnings (E) in 
response to a | percent change in nonlabor income (V) is given by (d In E/d InV) = 
[Ky/(1 — Ky)(Sznz — 1) = [—Ky/(1 — Ky)]Sim < 0 if n> 0, where Ky = V/I,S, = wl /F, 
S72 Ge whetelis full income, J money income, / leisure, and Z consumption, and 
n; is the income elasticity of demand for good j. The percent change in money income 
resulting from a | percent change in V is (d In T/d\nV) = KySznz = Ky(1 — Sym) > Oif nz 
= 

23 In the income equation reported in row 3, the inheritance variable was predicted 
by using a Tobit regression to take account of the truncation problem. The results do 
not differ substantially when this variable is entered in the usual manner, ignoring the 
truncation at the lower limit. The income equation also contains a dummy variable for 
the presence/absence of nonlabor income (NLABINC) and the number of household 
members employed (NEMPLR), in order to hold constant differences in labor force 
participation between households. The estimated coefficient on schooling in the income 
equation is “large” in comparison with estimates of the rate of return to schooling. 
However, the dependent variable is (the log of) family income rather than individual 
earnings. If both the education levels and earnings of spouses are positively correlated, 
this could explain the large marginal effect of schooling. 

24 This conclusion differs from that reached by Sussman et al. (1970) based on the 
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This result presents prima facie evidence against the pure “child- 
services” model of inheritance (i.e., with no altruism), since the dece- 
dent as a monopsony demander of child services would never be- 
queath an inheritance such that the recipient located on the 
backward-bending region of the child-services supply schedule.2> We 
therefore conclude that the negative relationship between the amount 
inherited and the recipient’s income confirms the existence of an 
operative altruistic bequest motive and is consistent with the predic- 
tion that inherited material wealth serves as a means of redistributing 
resources among family members. 

Other findings may be briefly summarized. The decedent parent’s 
income enters the Tobit inheritance equation (row 1) with a 
significant positive coefficient, implying an income elasticity of 0.84 
when evaluated at the mean inheritance. Last, the total number of kin 
(I'NKIN)—the empirical measure of family size—enters the inheri- 
tance regression with a significant negative coefficient, consistent with 
the existence of substitution between family size (quantity) and the 
per capita bequest (quality). 

Row 5 reports a logit regression in which the dependent variable is 
the probability of receiving an inheritance in excess of the lower-limit 
value.** This equation represents a test of the ability of the model to 


study of various cases: “The data indicate general agreement that the child who 
performs the greatest amount of service should receive the greatest part of the estate” 
(p. 119). The results reported here do not invalidate this explanation of estate distribu- 
tion in specific cases. Post hoc explanations of this finding are (i) that contact with aged 
parents 1s a time-intensive and inferior commodity or (ii) that children reporting the 
greatest frequency of contact are compensated during the parental lifetime (e.g., by 
living rent free in the parental home). A more complete model would include VISITS 
as an endogenous variable. Along these lines a simple OLS “supply of visits” equation 
was estimated (t-statistics in parentheses; + = In): 


VISITS = 2.593 — .097INCOMER} — .004INHR + 356NLABINC 


(2.831) (1.021) (2.743) 
+ .790SINGLER + .560SEXR + .314NEMPLR 
(3.950) (4.786) (3.528) 
— .008INCOMEDt — .025SCHD — .138TNKIN 
(.078) (.481) (4.059) 
~ .561RACE — .282MARRIEDD + .700JEWD; 
(1.670) (2.169) (2.456) 


R? = .162, N = 605. 


The recipient's income enters with a significant negative coefficient, while the inheri- 
tance coefficient is negative but not significant. These results are subject to possible 
simultaneous-equations bias. 

*° However, this finding is consistent with a child-service model which incorporates 
parental altruism. 

*’ The logit procedure assumes the distribution of u is Sech? rather than normal. 
However, the cumulative distribution of the logistic function closely approximates the 
normal except in the tails (Nerlove and Press 1973). As expected, a probit regression 
produced similar results. 
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assign individuals to the alternative regimes?’ (see eq. [8a]). As pre- 
dicted by the theory, the recipient’s income and the number of kin 
enter with significant negative coefficients.”* In contrast to the Tobit 
results (row 1), neither the decedent’s income nor the VISITS vari- 
able is significant in the logit regression, while the decedent’s school- 
ing enters with a significant positive coefficient. 

The final row of table 2 reports a reduced-form Tobit inheritance 
equation. This regression allows us to determine the effects of pa- 
rental income and education on inheritance when family size and the 
recipient's income are permitted to vary. In this equation, the income 
elasticity of inheritance is 1.71, evaluated at the mean of the de- 
pendent variable—a value significantly different from unity at the 6 
percent level. This estimate is considerably larger than the estimate in 
row l, a result consistent with the prediction of Becker and Tomes 
(1976) that, because of the effects of quality-quantity interaction, the 
“observed” income elasticity of child quality will exceed the underly- 
ing “true” elasticity. The decedent’s schooling enters the reduced- 
form inheritance regression with a significant negative coefficient. 
This confirms the prediction that more educated parents invest more 
in their children’s human capital (see table 3) and as a consequence 
make smaller bequests of material wealth.”? The parameter estimate 
implies that each additional year of decedent’s schooling reduces the 
child’s inheritance by almost $2,000—a sizable effect in relation to the 
average inheritance of $4,257. 


Estimates of the Complete Model 


Table 3 presents estimates of the four-equation model. In these re- 
gressions the sample is partitioned into two categories according to 
whether children inherit in excess of the limit value or not. Since the 
self-selection of households into either category results in the possibil- 
ity of sample-selection bias, the correction procedure outlined by 
Heckman (1976), 1979) was implemented. This procedure involves 
initially computing Mills ratios from the probit equation of the selec- 
tion rule and including these “omitted” variables—designated “A,” 


27 As an intuitive test of ability of the logit equation to assign individuals to regimes, 
recipients were assigned to 0,1 categories according to the predicted P S .5. This 
resulted in the “correct” assignment of 73.12 percent of recipients as compared with a 
50 percent “correct” assignment based on random assignments using the population 
proportions. 

28 This first result confirms the finding of Sussman et al. (1970) that “those most likely 
to inherit were persons with the smallest monthly incomes” (p. 155). However, those 
authors also state that “the percentage of survivors who inherited did not vary greatly 
among [survivor] income categories” (ibid.). 

29 Additional empirical evidence in support of this prediction is reported in Tomes 
(19816), where it is found that the total estate of the decedent is inversely related to the 
decedent’s education, holding the decedent’s income and other characteristics constant. 
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and “d,” in table 3—in subsequent regressions using the selected 
sample.® Table 3 reports the results obtained using 3SLS, which 
allows the error terms of the equations to be correlated. 

Rows 1-8 report results for children who received a bequest in 
excess of the limit value. The estimates of the inheritance and income 
equations duplicate quite closely those obtained for the whole sample 
(table 2, rows 1-3) and require little additional discussion. Recip- 
ient’s income and parents’ income enter the inheritance equation with 
the predicted negative and positive signs, with elasticities (at the 
mean) of — 0.74 and + 0.82, respectively.*! In contrast to the results in 
table 2, inheritance is not significant in the income equation (row 2), 
and the VISITS variable is not significant in line | or 2.” 

Although the semilog specification of the inheritance equation may 
be preferred on the a priori grounds that it embodies the prediction 
of a decreasing income elasticity, the constant elasticity formulation 
was also estimated (row 5). The estimates show that the major qualita- 
tive results are not dependent on the use of the semilog functional 
form. 

The results reported in table 3 represent a direct test of the two- 
regime model. The major question to be addressed is whether the 


3° If eq. (10) represents the inheritance equation, the expected values of variables Z; 
for the censored samples can be shown (see Willis and Rosen 1979) to be 


oC 

















E(Z;|a > .250) = y;X + E(e;| BX >u) = y;X + < Ne 
E(Z;|a = .250) = y;X + E(e| BX <u) = yj;X + a XS, 
where 
he B(2| 4 < EP) = —fepxioy/F(pxioy) <0, 
dos E(2|4 = EX) = pepxsoyitt — F(Bx/oy)1 = 0, 


where F is the cumulative normal density and f is its probability density function. The 
estimation procedure employs the estimated values \, and , from a probit regression 
including all the exogenous variables except VISITS—because of the dubious 
exogeneity of this variable (see n. 24). The presence of the selection rule introduces 
heteroscedasticity into the equation residuals, resulting in upward-biased t-statistics. 
However, in the present context, there is a presumption that the bias is small. Applying 
the appropriate correction procedure to the t-statistics from the 2SLS estimates resulted 
in only marginal changes in the ¢-statistics. 

*! The significant positive coefficient on A, in row 9 indicates that individuals who 
inherit zero receive higher incomes than individuals with the same income-earning 
characteristics in the whole sample. This also supports the compensation hypothesis. 

* The income equation (row 2) constrains the background variables (family size, 
decedent’s income and schooling) to enter recursively via the recipient’s schooling. 
When these variables are entered directly (results not reported here), the major conclu- 
sions are unaffected. 
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determinants of human capital investment differ among households 
according to whether material wealth transfers are positive or zero. A 
comparison of the schooling regressions for the two subsamples (rows 
3 and 10) indicates differences which are in the predicted direction. 
For children receiving an inheritance in excess of the limit value, the 
level of schooling is unrelated to either parental income or family size 
but is positively related to parental education. In contrast, for chil- 
dren who received a zero inheritance, the level of schooling depends 
on parental income (with an income elasticity of 0.07) but is unrelated 
to parental education. Thus opportunities (parental income) are im- 
portant for households making zero material wealth transfers, while 
parental education (one determinant of ability) is more important for 
households making positive material wealth transfers. However, con- 
trary to the theoretical prediction, the family size variable—a deter- 
minant of opportunities—is not significant in the school equation for 
children who inherit zero (row 10) upon whom financial constraints 
are expected to be binding. 

The family size regressions also differ between the two subsamples 
(rows 4 and 11), although concerning these differences we have no 
predictions. For families making asset transfers in excess of the limit 
value (row 4), parental income has a positive and significant co- 
efficient—the only evidence in these data of a positive income elas- 
ticity of family size. Parental education enters with a negative coeffi- 
cient which is significant at the 10 percent level. In contrast, for the 
sample receiving a zero inheritance (row 11), the parental income 
variable is negative and not significant, while parental education en- 
ters with a significant positive coefficient. These results underline the 
complexity and ambiguity of the relationships among family size and 
parental income and education. The one result common to both 
subsamples is that measures of child quality (the child’s inheritance 
and/or schooling) enter with significant negative coefficients in the 
family size equations, consistent with the existence of a quality- 
quantity trade-off.** 


Additional Results 


A major finding of the foregoing analysis is that, when parental 
income and education are held constant, children with higher in- 
comes receive a smaller inheritance of material wealth. This is consis- 
tent with the theoretical prediction that parents respond to dif- 


33 The significant positive coefficients on A, in rows 4 and 8 imply that those who 
receive a positive inheritance are more likely to come from a smaller family than 
random individuals with the same characteristics. This also implies a quality/quantity 
trade-off. 
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ferences in the ability of children by reducing transfers to children in 
order to finance greater parental consumption. In this way, parents 
share in the returns to greater child endowments. However, an alter- 
native interpretation is that the observed inverse relationship between 
inheritance and children’s income reflects (partly or entirely) the role 
of inheritance in compensating for differences among siblings rather 
than between parents and children. Thus, if parents bequeath a 
larger share of the estate to the child with the lowest income and a 
smaller share to his advantaged sibling, this also implies an inverse 
relationship between inheritance and the recipient’s income. 

This issue can be addressed using data on 137 families for whom 
information is available on more than one child. In this subsample, 
41.6 percent of heirs (siblings) received exactly equal shares. How- 
ever, this figure includes cases in which all heirs received the limit 
value of zero and therefore, by definition, inherited equally. Exclud- 
ing such cases, 21.1 percent of heirs inherited exactly equal shares, 
and the vast majority of heirs (78.9 percent) received unequal shares 
in the parental estate. However, some of these between-sibling dif- 
ferences are small: 50.4 percent of heirs received approximately equal 
shares—within $500 of the average inheritance received by children. 
Nevertheless, the incidence of equal division is dramatically lower 
than that found by Menchik (1980) in his sample of large Connecticut 
estates. Menchik found that “equal sharing among children is the 
rule” (p. 299). For example, among two-child families, in 62.5 percent 
of cases both children received exactly equal bequests and in 70.5 
percent of cases approximately equal bequests (Menchik 1980, p- 
310). The corresponding figures for two-child families in the present 
sample are 22.2 percent and 44.4 percent.** The fact that strictly 
equal division is not the rule in this sample suggests that the possibility 
that inheritance is equalizing between siblings warrants investigation. 

Table 4 reports the results of Tobit regressions for various subsam- 
ples of children from families with information on multiple heirs. The 
availability of multiple observations per family also permits the inclu- 
sion of an unobserved family component in the income prediction 
equation. The constructed measure of the decedent’s income em- 


**'Two possible explanations of this difference are: (i) In large estates donors may 
make sizable transfers in the form of inter vivos gifts in order to avoid estate and 
inheritance taxes. Thus, unequal lifetime gifts by wealthy donors may serve the same 
functions as unequal bequests by less wealthy donors. (ii) Apparent equal division in 
probate records is consistent with de facto unequal division when estates are redistrib- 
uted among heirs. Sussman et al. (1970) report that, in 27 percent of intestate cases 
(i.e., the decedent left no will) for which the law prescribed equal division among 
siblings, ex post redistribution among siblings occurred resulting in unequal shares (pp. 
143-44). Thus probate records may underestimate the incidence of unequal division. 
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ployed in table 4 includes this family component.* In these regres- 
sions, both the mean income of children and the deviation of the 
child’s income from the family mean are introduced as determinants 
of inheritance. The latter variable is predicted to enter with a negative 
coefficient if disadvantaged children receive larger shares in the pa- 
rental estate. This variable enters with the predicted negative sign but 
never attains statistical significance. In contrast, the mean income of 
children enters consistently with a highly significant negative sign. 
Estimates of the elasticity of inheritance with respect to the mean 
income of children cluster closely around — 1.8, evaluated at the mean 
of the dependent variable. Thus, when we hold income inequality 
among children constant, the higher the average income of children, 
the lower the inheritance received by children. These results suggest 
that the inverse relationship between recipient’s income and inheri- 
tance reflects for the most part a reallocation of resources between the 
generations of the family rather than simply the redistribution among 
-siblings of a given total bequest to children. 

These results for the subsample for which information is available 
on multiple heirs and also those for the subsample with information 
on both the decedent and surviving spouse*® (not reported here) 
confirm the results reported earlier for the whole sample. 


Vv. Summary and Conclusions 


I have presented an empirical model of the intergenerational trans- 
mission of inequality within the altruistic family. This paper adds 


35 Adams (1980) employed a conceptually similar, but less appealing, technique using 
the mean income of recipients in a family to measure the decedent’s income. The 
procedure employed here was to estimate an income prediction equation similar to that 
in table 1, §, for the subsample for whom information is available for two or more 
survivors—either children or the surviving spouse (N = 431). The mean residual was 
then computed for each family and entered in a subsequent income equation. The 
estimated coefficients from this second regression, together with the decedent’s char- 
acteristics and the family mean residual, were used to construct the measure of the 
decedent’s income. The weight attached to the family residual depends on the extent to 
which the unobserved family component is correlated between spouses and transmitted 
from parents to children. In the absence of information on this parameter, weights 
were used in the unit interval (0.0, 0.1, ..., 1.0). The reported results employ the weight 
which minimized the standard error of the regression (see note to table 4). In practice, 
the measure of the decedent’s income used in tables 2—3 performed almost as well as 
the measure used in table 4. (See the results reported in the earlier version of this 
paper.) The estimated income elasticity of inheritance did not differ in a systematic 
fashion using the two alternative income measures. 

36 For this subsample (N = 191), the effects of mother’s and father’s education, ages, 
and incomes were analyzed; also, a measure of the predicted bequest from the surviv- 
ing spouse to children was constructed and used in the subsequent regressions. The 
results, which confirm the basic conclusions of this study, were reported in the earlier 
version of this paper. 
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empirical content to the theoretical models of Becker, Blinder, and 
Ishikawa in which bequests of material wealth and investment in 
children’s human capital are simultaneously determined and repre- 
sent substitute modes of parent-child transfers. 

Empirical tests of the model were performed using a unique data 
set containing information on the inheritance, income, and schooling 
of the surviving kin of a random sample of estates probated in the 
Cleveland, Ohio, area in 1964-65. 

The empirical results strongly confirm the equalizing role of in- 
heritance and the existence of substitution between human capital 
investment and inherited material wealth. The inheritance received 
by the child was found to be inversely related to the child’s income. 
Material wealth transfers are therefore “compensatory”—in that 
(other things being equal) children with low incomes receive greater 
bequests of material wealth than their better-endowed contem- 
poraries. Our estimates imply that, for individuals in the sample who 
received a positive inheritance (table 3, row 1), the compensatory 
component of bequests reduced the variance of inherited wealth by 
30 percent and reduced the correlation between inheritance and 
income from 0.53 to 0.12. This result confirms the prediction of 
the altruism model of intergenerational transfers that an increase 
in the income of recipients leads the donor to reduce transfers in 
order to redistribute family resources in favor of his own consump- 
tion. The data do not support the alternative explanations of this in- 
verse relationship—that the receipt of an inheritance has a work- 
disincentive effect, that low-income children inherit more in return 
for the greater child services they provide for their aged parents, or 
that the observed relationship reflects the reallocation among siblings 
of shares in a given parental estate. Differences across families in 
parental education were found to result in variations in inherited 
wealth. Greater parental education is associated with more schooling 
and higher incomes of children and lower bequests of material 
wealth—results consistent with the theoretical predictions of the al- 
truism model. 

There is evidence that the financial constraints faced by families 
differ across households. Families who make positive material wealth 
transfers appear less constrained financially when it comes to invest- 
ing in their children’s human capital—and ability variables are the 
major determinants of children’s schooling. Conversely, families 
making zero bequests to children appear more constrained finan- 
cially, so that opportunity variables are the major determinants of 
children’s schooling. For both categories of households, there is evi- 
dence of a trade-off between family size (quantity) and per capita 
parent-child expenditures (child quality). 
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The first result is important for a number of reasons. Previous 
empirical work (Brittain 1978; Menchik 1980) has failed to find any 
evidence of compensatory bequests among siblings. In contrast, the 
present study provides strong evidence that within the family— 
between parents and children—and among families in the same 
income/education stratum, the distribution of inherited wealth is 
equalizing. This supports the recent suggestion of Griliches (1979), 
that the family contributes less to the intergenerational transmission 
of inequality than is commonly supposed, since it operates as a force 
for equality between successive generations (parents and children). 


Appendix 


MEANS OF VARIABLES FOR VARIOUS SAMPLES 








Sons/Daughters Sons/Daughters 





in col. 1 Receiving in col. 1 
an Inheritance Receiving 
All Sons/Daughters in Excess of an Inheritance 
Reporting Positive the Limit Equal to the 
Family Income Value Limit Value 
(N = 605) (N = 354) (N = 251) 
Variable (1) (2) (3) 
INHR ($000s) 4.257 7.099 .250 
INCOMER ($000s) 11.518 11.922 10.973 
SCHR 12.570 12257 12.645 
AGER 42.828 44.384 40.633 
SEXR (% female) 49.3 48.6 50.2 
NEMPLR 1.190 1.184 1.199 
NLABINC 29.8 ile Deo 
(% reporting the 
receipt of nonlabor 
income) 
SINGLER (%) 9.6 9.0 10.4 
DIV/SEPR (%) 5S “3 4 
INCOMED ($000s) 6.085 5.559 6.906 
SCHD 9.102 9.054 ona 
AGED 70.630 72.056 68.618 
SEXD (% female) 41.7 52.5 26.3 
MARRIEDD (%) 58.8 59.5 86.1 
TNKIN 4.134 3.791 4.618 
OTHERKIN 1.020 .743 1.410 
RACE (% nonwhite) 3.3 Dui 5.6 
CATHD (%) 44) 44.9 43.0 
PROTD (%) 43.6 41.5 46.6 
JEWD (%) 5.1 6.2 3.6 
ORIGINUS (%) 51.6 452 60.6 
ORIGINWE (%) 16.2 18.1 13.5 
VISITS 1.443 1.550 e292, 
Na O90 —.506 905 


Xp 1.068 IESIL9 713 
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Intraindustry Specialization and 
the Gains from Trade 


Paul R. Krugman 


Massachusetts Institute of Technology 


Several recent empirical studies of trade suggest that interindustry 
specialization and trade, which reflect the conventional forces of 
comparative advantage, are also accompanied by ¢ntravndustry 
specialization, which reflects scale economies and consumers’ taste 
for a diversity of products. This paper develops a simple model 
which illustrates this argument. Two main results are developed. 
First, the nature of trade depends on how similar countries are in 
their factor endowments. As countries become more similar, the 
trade between them will increasingly become intraindustry in 
character. Second, the effects of opening trade depend on its type. If 
intraindustry trade is sufficiently dominant, the advantages of ex- 
tending the market will outweigh the distributional effects, and the 
owners of scarce as well as of abundant factors will be better off. 


Over the years, many empirical students of international trade have 
argued that trade among the industrial countries cannot adequately 
be explained by conventional theories of comparative advantage. One 
might summarize this empirical critique by pointing to three as- 
pects of world trade which seem to contradict received theory. First, 
much of world trade is between countries with similar factor endow- 
ments. Second, a large part of trade is intraindustry in character—that 
is, it consists of two-way trade in similar products. Finally, much of the 
expansion of trade in the postwar period has taken place without 


This research was supported by a grant from the National Science Foundation. An 
earlier version of this paper, “International Trade and Income Distribution: A Recon- 
sideration,” was presented at the NBER Summer Institute in International Studies, 
Cambridge, Massachusetts, July 1979. 
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sizable reallocation of resources or income-distribution effects. This 
last point is particularly noticeable in the cases of the EEC and the 
North American automobile pact. 

The purpose of this paper is to formalize one possible explanation 
of these seeming paradoxes. The explanation is not a new one: It is 
essentially the same as that put forward by Balassa (1967), Grubel 
(1970), and Kravis (1971), among others. What this paper does is put 
the argument in terms of a formal model, a step which may be of 
some help in clarifying and disseminating ideas which have been “in 
the air” for some time. 

Briefly, the argument of these empirical workers, a very clear ex- 
position of which is given by Kravis (1971), runs as follows. The 
conventional forces of comparative advantage operate on groups of 
products (“industries”) and thus give rise to interindustry specialization 
and trade. Economies of scale in production, however, lead each 
country to produce only a subset of the products within each group, 
so that there is also intraindustry specialization and trade. This pro- 
vides a simple explanation of two of our empirical ostensible 
paradoxes. Countries with similar factor endowments will still trade 
because of scale economies, and their trade will be largely intraindus- 
try in character. The third seeming paradox—the apparent painless- 
ness of some trade liberalization—can also be resolved if we argue 
that income-distribution effects are outweighed by the gains from a 
larger market when countries are sufficiently similar. 

While this is a simple and straightforward explanation, however, it 
is not so easy to formalize. Scale economies are crucial to the argu- 
ment, and they are notoriously awkward to handle in general equilib- 
rium models. In this paper I follow an earlier paper (Krugman 1979) 
and use the device of Chamberlinian monopolistic competition. As in 
the earlier paper, this proves to be a very convenient approach, yield- 
ing a simple and tractable model. The structure of this model and the 
determination of this model's equilibrium in a closed economy are set 
forth in Section I. Section II shows how the pattern of trade between 
two countries is determined in the model, developing the basic re- 
lationship between differences in factor endowments and the extent 
of intraindustry trade. Section III then examines the effects of trade 
on income distribution and shows how the extent of intraindustry 
trade determines whether scarce factors of production gain or lose 
from trade. Finally, Section IV summarizes the results and discusses 
some implications for theory and policy. 

It must be emphasized that the model presented here is in no sense 
a general one. In addition to making strong assumptions about func- 
tional forms of cost and utility functions, I impose a great deal of 
symmetry on the model to simplify the analysis and give a natural 
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meaning to the concept of “similarity” in factor proportions. Thus the 
results of the analysis are at best suggestive. Nonetheless, they seem 
intuitively plausible and also seem to have something to do with actual 
experience. 


I. The Model in a Closed Economy 


Intraindustry trade depends on the existence of unexhausted econo- 
mies of scale in production. The main problem in modeling this kind 
of trade is how to handle these scale economies, which must lead to a 
breakdown of perfect competition (unless they are wholly external to 
firms). In this paper, as in an earlier paper (Krugman 1979), I will use 
the device of Chamberlinian monopolistic competition, basing the 
model on recent work by Dixit and Stiglitz (1977). An “industry” will 
consist of a large number of firms, all producing somewhat differ- 
entiated products, all operating on the downward-sloping parts of 
their average cost curves. There will be two-way international trade 
within an industry because firms in different countries will produce 
different differentiated products. What prevents countries from pro- 
ducing a complete range of products domestically is the existence of 
fixed costs in production; thus scale economies are the basic cause of 
intraindustry trade. 

We should note at the outset that the concept of an industry used in 
this paper is a somewhat special one. One might want to define an 
industry either as a group of products which are close substitutes on 
the supply side or as a group of products which are close substitutes 
on the demand side. In the model of this paper, I assume that there 
are two groups of products which fit both definitions. Products within 
each group are closer substitutes than products in different groups, 
while factors of production are assumed mobile among products 
within each group yet immobile between groups. This convenient 
coincidence of the two possible concepts of an industry may or may 
not be empirically reasonable; it is certainly not theoretically necessary 
and should be regarded as one among many special assumptions. 

Another conceptual difficulty concerns the notion of a “product.” 
In the formulation below, all products seem to look alike, since they 
enter symmetrically into cost and utility functions. This may seem to 
involve an illégitimate comparison of physical quantities of different 
goods. I show in the Appendix, however, that the formulation of 
many “identical” products can be interpreted as a restriction on the 
parameters of a model in which products really do differ. 

Let us begin, then, with a two-industry model of a closed economy. 
Each industry consists of a large number of products, all of which 
enter symmetrically into demand, with the two industries—industry 1 
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and industry 2—themselves playing symmetric roles. All individuals 
will have the convenient utility function. 


N1 1/0 N2 1/0 
u =in(Y et +In (>. (1) 


i=1 j=1 


OSOr=a 1) 


where ¢,; is consumption of the ith product of industry 1; c2,; is 
consumption of the jth product of industry 2; and N, and N, are the 
(large) numbers of potential products in each industry. Not all poten- 
tial products will necessarily be produced, and we will in fact assume 
that the actual numbers of products produced—n, and n,—while 
large, fall short of N, and N3. 

The utility function (1) has several useful properties. First, it en- 
sures that half of income will always be spent on industry 1’s products. 
Second, if the number of products in each industry is large, it implies 
that every producer faces a demand curve with elasticity 1/(1 — 6). 
Finally, (1) will allow us to represent the gains and losses from trade in 
a particularly simple way. 

On the demand side, then, an industry is assumed to consist of a 
number of products which are imperfect substitutes for one another. 
On the supply side, however, they will be assumed to be perfect sub- 
stitutes. There will be only two factors of production, type 1 labor and 
type 2 labor, each of which is wholly specific to an industry but 
nonspecific among products within an industry. Thus, type 1 labor 
will be used only in industry 1, type 2 only in industry 2. Within each 
industry, the labor required to produce a particular product will 
consist of a fixed setup cost and a constant variable cost; 


Li=art Bry, 12H ee eas 
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where /,; is labor used in producing the ith product of industry, 1; x,; 
is the output of that product; and so on. To go from these required 
labor inputs to nominal costs, we must multiply by the wage rates of 
the two types of labor, w, and wy. 

To close the model, we begin by noting that output of each product, 
x, is the sum of individual consumptions of the product. At the same 
time, total employment in each industry is the sum of employment in 
producing all the individual products. Assuming full employment, we 
have 
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Thus the total labor force is set equal to 2, with the parameter z 
measuring factor proportions. As we will see below, z will assume 
crucial significance in determining the importance of intraindustry 
trade and the effect of trade on income distribution. 

We are now prepared to examine the determination of equilibrium 
in this model. This involves determining how many products are 
actually produced in each industry, the output of each product, the 
prices of products, and the relative wages of the two kinds of labor. 
We should note at the outset that it is indeterminate which products 
are produced—but it is also unimportant. 

Our first step is to determine the pricing policy of firms. We assume 
that producers can always costlessly differentiate their products. This 
means that each product will be produced by only one firm. If there 
are many products the elasticity of demand for each product will, as 
already noted, be 1/(1 — 6). (This is proved in the Appendix.) Thus, 
each firm will face a demand curve of constant elasticity. We then 
have the familiar result that the profit-maximizing price will be mar- 
ginal cost plus a constant percentage markup: 


Pi = 6"Bw,, (4) 
p2 = 9 "Bw, 


where p, and p,» are the prices of any products in industry 1 and 2, 
respectively, which are actually produced. 
Given the pricing policy of firms, actual profits depend on sales: 


Or 2k. (3) 


W1 = prs — (a + Bx,)w,, (5) 
TW. = PK. — (a + Bx2)We, 


where x, and x, are sales of representative firms in the two industries. 
But in this model there will be free entry of firms, driving each 
industry to Chamberlin’s “tangency solution” where profits are zero. 
Thus we can use the condition of zero profits in equilibrium to 
determine the equilibrium size and number of firms. Setting 7; = 72 
= 0 and using (4) and (5), we have 
a 6 
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for the size of firms. The number of firms can then be determined 
from the full-employment condition: 


ty = (21 2) (a4 Bx,), (7) 
Ny = z/(a + Bxo). 


The final step in determining equilibrium is to determine relative 
wages. This can be done very simply by noting that the industries 
receive equal shares of expenditure and that, since profits are zero in 
equilibrium, these receipts go entirely to the wages of the industry- 
specific labor forces. So w,L; = w2Ll2, implying 


w/w, = z/(2 — z). (8) 


We now have a completely worked out equilibrium for a two-sector, 
monopolistically competitive economy. It is indeterminate which of 
the range of potential products within each industry are actually 
produced, but since all products appear symmetrically, this is of no 
welfare significance. The character of the economy is determined by 
the two parameters z and 6. The value of z determines relative wages: 
If z is low, type 2 labor will receive much higher wages than type 1 
labor. The value of 6 measures the degree of substitutability among 
products within an industry. The lower is 6, the more differentiated 
are products, and the more important are unexploited scale econo- 
mies. From (4) we have 0 = Bw,/p, = Bw2/po. But Bw, and Bw, are the 
marginal costs of production, while in equilibrium price equals aver- 
age cost. Thus @ is the ratio of marginal to average cost (which is also 
the elasticity of cost with respect to output). 


II. Factor Proportions and the Pattern of Trade 


In the last section we saw how equilibrium can be determined in a 
simple closed-economy model with scale economies and differ- 
entiated products. We can now examine what happens when two such 
economies trade. What we are principally concerned with is the prop- 
osition, advanced in the introduction, that countries with similar fac- 
tor endowments will engage in intraindustry trade, while countries 
with very different endowments will engage in Heckscher-Ohlin 
trade. 

As a first step we need a working measure of the extent of intrain- 
dustry trade. The empirical literature on intraindustry trade (e.g., 
Hufbauer and Chilas 1974; Grubel and Lloyd 1975) generally con- 
centrates on an index of trade overlap, that is, 


ae (> = Mi\)/| De 2 My} (9) 
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where X;, is a country’s exports in industry & and M, is imports in that 
industry. This index has the property that, if trade is balanced indus- 
try by industry, it equals one, while if there is complete international 
specialization so that every industry is either an export or import 
industry, it equals zero. As we will see, this index fits in quite well with 
the model of this paper. 

The other concept we need to make operational is that of similarity 
in factor endowments. In general, this is not well defined. What I will 
do in this paper, however, is consider a special case in which the 
concept does have a natural meaning without trying to arrive at a 
general definition. 

Let us suppose, then, that there are two countries, the home coun- 
try and the foreign country. The home country will be just as de- 
scribed in Section I. The foreign country will be identical except for 
one thing: The relative sizes of the two industries’ labor forces will be 
reversed. That is, the foreign country will be a mirror image of the 
home country. If we use a star on a variable to indicate that it refers to 
the foreign country, we have 
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Obviously, given this pattern of endowments, we can regard z as an 
index of similarity in factor proportions. If z = 1, the countries have 
identical endowments. As z gets smaller, the factor proportions be- 
come increasingly different. 

The mirror-image assumption can be given a geometric interpreta- 
tion. In figure 1, an Edgeworth box is used to represent the interna- 
tional distribution of productive resources. The origin O is used to 
measure home country endowments, O* to measure foreign endow- 
ments. he two diagonals of the box can then be given economic 
interpretations: OO* is a line along which factor proportions are equal 
in the two countries, while the other diagonal is a line along which the 
countries are of equal economic size. The mirror-image assumption is 
saying that the endowment point E lies on this diagonal. The 
parameter z then determines the position of E; as z goes from 0 to 1, E 
moves from the corner to the center of the box. 

Suppose, now, that these countries are able to trade at zero trans- 
portation cost. As before, we can determine pricing behavior, the size 
and number of firms, and relative wages. In addition, we can deter- 
mine the volume and pattern of trade. 

The first point to note is that the elasticity of demand for any 
particular product is still 1/(1 — 6). This gives us price equations 
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exactly the same as before: 
pus 6 *Bw,, 
pe = 0 'Bw2, (11) 
Pi = acon 
pk = 0 "Bus. 


Now, however, the symmetry of the setup insures that all wages will 
be equal, both across industries and internationally: 


WwW, = w¥ = w. = wi. (12) 


The zero-profit condition will determine the equilibrium size of 
firm, x, which will be the same for both industries in both countries: 


x = a6/B(1 — 6). (13) 


Finally, full employment determines the number of firms in each 
industry in each country: 


N= Ts = 2 ie) eX) 


nN, = n¥ = z/(a + Bx). 


(14) 


What these results show is that trade will lead to factor price 
equalization while leaving the pattern of production unchanged. Our 
remaining task is to determine the volume and pattern of trade. We 
can do this by noting two points. First, everyone will devote equal 
shares of expenditure to the two industries. Second, everyone will 
spend an equal amount on each of the products within an industry. 
This means that the share of all individuals’ income falling on, say, 
industry 1 products produced in the foreign country is ¥% - [n#/(n, + 
n¥)|—that is, the industry share in expenditure times that country’s 
share of the industry. But the number of products is proportional to 
the labor force. Thus, if we let Y be the home country’s income 
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(equal to the foreign country’s), X; be exports of industry 1 products, 
X, be exports of industry 2 products, M, be imports of industry 1 
products, and M, be imports of industry 2 products, we have 


Age kee 2) 2 |, 
Xo = WY - (2/2), 
M, = WY - (2/2), 
M, = WY - [(2 — z)/2]. 


(15) 


Now, the relations (15) have two important implications. First, 
consider the volume of trade. Total home country exports are X, + X, 
= 2Y. Thus the ratio of trade to income is independent of z, the 
index of similarity in factor proportions. This can be regarded as an 
answer to the first ostensible empirical paradox mentioned in the 
introduction—the large volume of trade among similar countries. In 
this model, similar countries will trade just as much as dissimilar 
countries. 

The second seeming empirical paradox was the prevalence, in 
trade among similar countries, of two-way trade in similar products. 
If we substitute (15) into our expression for intraindustry trade (9), 
we get a simple, striking result: 


I =x. (16) 


The index of intraindustry trade equals the index of similarity in factor 
proportions. 

These results may appear to depend crucially on the assumptions of 
this model, but in qualitative terms they can survive a good deal of 
generalization. The persistence of trade between countries with simi- 
lar factor endowments will occur in almost any model with economies 
of scale. The relationship between similarity of countries and the 
extent of intraindustry trade can be shown to hold, for an appropriate 
definition of similarity, in a much more general model and has also 
been noted in a quite different context by Ethier (1979). Insofar as 
these insights are concerned, the virtue of this model is not in the 
difference of its conclusions but in the clarity with which they emerge. 

Where the special assumptions of this model become particularly 
useful, however, is in attempting to deal with the welfare conse- 
quences of trade. These consequences are considered in the next 
section. 


III. Gains and Losses from Trade 


In this section we must again begin by delineating a concept which I 
have been using loosely. This is the idea of the “seriousness” of 
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distribution problems. What we need is a clear way of formulating the 
notion that distribution problems from opening trade will not be 
serious, if countries are sufficiently similar in factor proportions that 
the trade which results is primarily intraindustry trade. 

The criterion I will use to define nonserious distribution problems 
is the following: Distribution problems arising from trade will be held 
not to be serious if both factors gain from trade. This, of course, begs 
some questions, since there may be difficulties in getting groups to 
accept a relative decline in income even if they are absolutely better 
off. But this criterion is fairly reasonable and turns out to give 
suggestive results. 

To find out whether factors gain from trade, we need to know how 
utility depends on the variables of the model. Suppose an individual 
receives a wage w and has the utility function (1). He will then spend 
w/2 on the products of each industry and divide his expenditure 
equally among the products within an industry. Thus his utility will 
depend on his wage, the prices of representative products in each 
industry, and the number of products available: 


U = In [n,(w/2n,p,)°F? + In [no(w/2nopo)?}? 


(17) 
J : Z ln; bs 
The function (17) has the convenient property that all the effects 
enter additively. Utility depends on real wages in terms of represen- 
tative products and on diversity. 
To analyze the effects of trade on welfare, it is useful to introduce 
some more notation: 








=—21n 2 + In wip; + Inewipy + In ny. 


U,,U, = utility of workers in industries 1 and 2; 


W11, Wz = real wage of industry 1 workers in terms of products of 
industries 1 and 2; 


W21, W22 = real wage of industry 2 workers in terms of products of 
industries 1 and 2. 


Then we can substitute into (17) to get (suppressing the constant 
term): 


U, = Inw,, + Inwy,, + 2 
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We are now in a position to measure the welfare effects of trade. 
Suppose we start from a position of autarky, as in Section I, then 
move to free trade, as in Section II. There will then be two kinds of 
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effects. First, there will be a distribution effect as factor prices are 
equalized. As one can easily verify, labor’s real wage remains the same 
in terms of the products of its own industry while rising or falling in 
terms of the other industry’s products, depending on whether the 
factor is abundant or scarce. Thus, in the home country this effect 
benefits labor in industry 1 and hurts labor in industry 2. 

The second effect comes from the increase in the size of the market, 
which makes a greater variety of products available. This works to 
everyone’s benefit. 

Since both effects work in its favor, the abundant factor must be 
made better off. This leaves us with the problem of determining the 
change in utility of the scarce factor—industry 2 labor in the home 
country and the symmetrically placed industry 1 labor in the foreign 
country. 

Let a prime on a variable indicate its free-trade value while un- 
marked variables refer to autarky. Then, as we move from the au- 
tarky solution in Section I to the free-trade solution in Section II, the 
change in U, is 


be Le 0 


In ni/n, + In nj/n5 
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where the first term is negative and represents the distribution loss; 
the remaining terms are positive and represent the gains from being 
part of a larger market. The question is under what conditions these 
terms will outweigh the first terms. 

By collecting terms, we can rewrite (19) as 


20-1 ] 


ey) = 9 Inz ~~, In2—z+ In 2. (20) 
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This gives us one immediate result: If 6 < 0.5, the scarce factor 
necessarily gains from trade, since the first term will be positive and 
the third term will outweigh the second. Recall that 6 is a measure of 
the substitutability of products within an industry. What this result 
then says is that of products are sufficiently differentiated, both factors gain 
from trade. 

If 6 > 0.5, whether both factors gain depends on the extent to 
which trade is intraindustry in character, which in turn depends on 
how similar the countries are in factor proportions. When 6 > 0.5, the 
function (20) has three properties: (i) as z approaches 1, U; — U, goes 
to [(2 — 26)/0@] In 2 > 0; (ii) as z goes to zero, Uz; — U2 goes to minus 
infinity; and (iii) VU; — U, 1s strictly increasing in z,. Thus, if we were to 
graph (20), it would look like figure 2. There is a critical value of z, Z, 
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for which U} — U, = 0. Ifz > Z, both factors gain; if z < Z, the scarce 
factor loses. But z is our measure of similarity in factor proportions. 
Thus what we have shown is that if countries have sufficiently sumilar 
factor endowments, both factors gain from trade. 

What is particularly nice about this result is that we have already 
seen that there is a one-for-one relationship between similarity of 
factor endowments and intraindustry trade. So this result can be 
taken as a vindication of the arguments of such authors as Kravis 
(1971) and Hufbauer and Chilas (1974) that intraindustry trade poses 
fewer adjustment problems than interindustry trade. 

We should note, however, that the critical value of interindustry 
trade depends on the substitutability of products. The function (20) is 
decreasing in 9: 0(U; — U,)/00 = 6 Inz(2 — z) < 0. So an increase in 6 
will shift the function down. This will increase z. The less differentiated 
are products, the more similar countries must be if both factors are to gain from 
trade. In the limit, as 6 goes to 1, so does z. 

The results of this section are summarized in figure 3. On the axes 
are the two parameters @ and z, both capable of taking on values 
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between zero and one. What we have shown is that the qualitative 
effects of trade depend on where we are in the unit square. In the 
southeastern part of the square—labeled “conflict of interest”—either 
scale economies are unimportant or countries are very different in 
factor endowments, and scarce factors lose from trade. In the other 
region—“mutual benefit”—the gains from intraindustry specializa- 
tion outweigh the conventional distributional effects, and everyone 
gains from trade. 


IV. Summary and Conclusions 


This paper began with three “paradoxes” about international trade. 
Since they do not seem paradoxical in the light of this model, perhaps 
we should state them as “stylized facts”: (i) Much of world trade is 
between countries with similar factor endowments. (ii) The trade 
between similar countries is largely intraindustry in character; that is, 
it consists of two-way trade in similar products. (iii) The growth of 
intraindustry trade has not posed serious income-distribution prob- 
lems. 

This paper offers a simple model which formalizes one possible 
explanation of these stylized facts. According to this view, the variety 
of products produced in any one country is limited by the existence of 
scale economies in production. Thus similar countries have an incen- 
tive to trade; their trade will typically be in products produced with 
similar factor proportions; and this trade will not involve the 
income-distribution effects characteristic of more conventional trade. 

In addition to helping make sense of some puzzling empirical 
results, this paper is, I hope, of some interest from the standpoint of 
pure theory. The model dispenses with the two most fundamental 
assumptions of standard trade theory: perfect competition and con- 
stant returns to scale. Instead, I have dealt in this paper with a world 
in which economies of scale are pervasive, and all firms have 
monopoly power. While the model depends on extremely restrictive 
assumptions, it does show that it is possible for trade theory to make at 
least some progress into this virtually unexplored territory. 


Appendix 
I. The Concept of a Product 


In the formulation in Section I, an industry was assumed to consist of many 
products with the “same” cost function and entering in the “same” way into 
utility. This may seem to involve a comparison of apples and oranges. How- 
ever, it can be justified as a restriction on the parameters of a more general 
model. 
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Consider the utility and cost functions for a one-industry model (the 
generalization to two industries is obvious): 


N 
6 
v=[>@co}"0<0<1, (AL) 
i=1 
L=a,+ BX, 2 — 9) TL (A2) 


Here we allow goods to enter with different weights into utility and to have 
different cost functions; thus no assumption is made about comparability of 
units. Given certain restrictions on parameters, however, it is possible to 
choose units so that a formulation where all products appear identical is valid. 
Let us suppose first that a; = a@ for all 7. The measurement of this cost is 
independent of the choice of units, so this is a meaningful assumption. Let us 
also assume £,/8; = 6 for all i. This again does not depend on units of 
measurement; measuring product 27 in batches of 10 instead of individual 
units will increase both By; and 6, by a factor of 10 and leave the ratio 
unchanged. 

If the assumptions about parameters are granted—and they are special 
assumptions, not general properties—we can justify the model in the text by a 
choice of units. Let C; = §;C; for all 7. Then the utility and cost functions 
become 


u=(de0)" (A3) 


lg= = BX, Slee, (A4) 
II. Elasticity of Demand for Individual Products 


The analysis in Section I depends on the result that the elasticity of demand 
for any particular product is 1/(1 — 6). This Appendix gives a demonstration 
of this. 

Consider an individual maximizing his utility function (1) subject to a 
budget constraint. The first-order conditions from that maximization will 
have the form 


cri) 
pia = ——"———__.,, i= 1 n 
ea ’ te ome by A 
Deed ne 
—(1—-4@) 
pe —— Co, J ~ 1 n 
oJ ’ = 922 2 9 U9, 
Ss 


where A is the shadow price on the budget constraint, that is, the marginal 
utility of income. 

If there are many products, however, the firm producing a particular 
product can take the denominators of these expressions as given. Thus each 
individual’s demand for a particular product, and therefore also market 
demand, will have elasticity 1/(1 — 4). 
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The Adjustment of Consumption to Changing 
Expectations about Future Income 


Marjorie A. Flavin 
University of Virginia 


The paper analyzes the role of current income in providing new 
information about future income and thus signaling changes in 
permanent income. Using time-series analysis to quantify the revi- 
sion in permanent income induced by an innovation in the current 
income process, a structural econometric model of consumption is 
developed. The rejection of the joint rational expectations— 
permanent income hypothesis is both statistically and quantitatively 
significant. The paper also shows that the test of the rational 
expectations—permanent income hypothesis proposed by Hall is 
based on the reduced form of this structural model and reconciles 
Sargent’s consumption paper with Hall's. 


Because the path of future income is uncertain, the individual’s per- 
manent income and, therefore, his consumption plans will be revised 
period by period as new information about future income becomes 
available. Under the assumption that individuals hold rational ex- 
pectations of future income, the stochastic properties of these 
period-by-period revisions in permanent income can be exploited in 
order to formulate testable restrictions on the joint time-series be- 
havior of consumption and other observable variables. 

This paper first reviews the econometric tests of the permanent 
income-rational expectations hypothesis proposed by Hall (1978) and 
Sargent (1978), then develops and executes an alternative approach 
to testing the hypothesis. Section I formally defines permanent in- 
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come and derives the basic implication of the permanent income- 
rational expectations hypothesis: If consumption is proportional to 
permanent income in each period, and permanent income, in turn, 
represents the best estimate, given currently available information, of 
the individual’s lifetime resources, then current consumption should 
diverge from consumption in the previous period by the amount of 
the contemporaneous revision in permanent income. 

The published papers of Hall (1978) and Sargent (1978) both 
analyze this predicted relationship between consumption, lagged con- 
sumption, and the revision in permanent income, conditional on 
lagged information sets.1 The two papers reach conflicting empirical 
conclusions, Hall confirming and Sargent decisively rejecting the 
permanent-income hypothesis. Section II argues that an incorrect 
definition of permanent income used in Sargent’s formulation of the 
restrictions is the source of the discrepancy and shows that Sargent’s 
analysis, once the definition of permanent income is corrected, leads 
to the same parameter restrictions tested by Hall. 

The restrictions tested by Hall and Sargent apply to regressions in 
which all of the right-hand-side variables are lagged at least 1 period 
with respect to the dependent variable, consumption. By analyzing 
the expectation conditional on lagged information sets, their ap- 
proach makes use of the property that the revision in permanent 
income should be uncorrelated with lagged variables. The Hall- 
Sargent tests can be thought of as tests based on a reduced-form 
consumption equation, since a reduced form expresses the expecta- 
tion of an endogenous variable, conditional on predetermined vari- 
ables. 

This paper takes an alternative approach to testing the permanent 
income-rational expectations hypothesis by explicitly specifying a 
structural model of consumption. The model developed in Section III 
analyzes the role of current income in providing new information 
about future income and thus signaling changes in permanent in- 
come. The model is intended to underscore the point that the ob- 
served sensitivity of consumption to current income should not be 
interpreted simply as the behavioral marginal propensity to consume 


1 Bilson and Glassman (1979) consider some extensions on the model which I 
develop in this paper; Bilson (1979) reports comparative estimates of the model applied 
to data for the United States, United Kingdom, and Germany. Barro (1978) has also 
addressed the question of whether certain variables affect consumption only through 
their role in predicting future income. Recent empirical work on consumption by 
Blinder (1981) incorporates explicit treatment of the expectations of future income in 
the permanent-income model. Blinder’s analysis concentrates on measuring the re- 
sponse of consumer expenditure to temporary changes in taxes rather than testing the 
permanent-income hypothesis itself. Hayashi (1979) also tests the permanent-income 
hypothesis and finds that consumption is more sensitive to current income than pre- 


dicted by the hypothesis. 
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out of current income. Because income is a fairly highly serially 
correlated process, the fluctuations in current income will be corre- 
lated with fluctuations in permanent income. Formally, the model 
uses an autoregressive-moving average (ARMA) representation of 
the income time series in order to quantify the magnitude of the 
revision in permanent income implied by the contemporaneous ob- 
servation of current income. The forecast error, or innovation, in the 
income time-series process represents the new information contained 
in the current observation on income. Based on the realization of the 
current innovation in the income process, individuals revise their 
expectations about future income. The magnitude of the revisions in 
expectations of future income in response to the realization of the 
current innovation depends on the parameters of the ARMA repre- 
sentation of the income process. Therefore, the revision in perma- 
nent income in a given period will be proportional to the innovation 
in current income, with the factor of proportionality determined by 
the parameters of the income time-series process. Following this ap- 
proach, one can specify a structural equation relating the change in 
consumption to the contemporaneous revision in permanent income 
(modeled using the income innovation) and the change in current 
income. The coefficient of the change in income is a measure of the 
behavioral marginal propensity to consume out of current income, 
since the role of current income in signaling changes in permanent 
income has been explicitly modeled. The permanent income-rational 
expectations hypothesis can then be tested by testing whether the 
marginal propensity to consume out of current income is significantly 
different from zero. Having developed a test of the permanent 
income-rational expectations hypothesis in the context of a simple 
structural econometric system, it is easy to show that the Hall-Sargent 
test is the equivalent test based on the reduced form of the system. If 
the structural system is just identified, the test based on the structural 
system and the Hall-Sargent test based on the reduced form yield 
numerically identical values of the test statistic for the permanent 
income-rational expectations hypothesis. While the structural ap- 
proach to testing the hypothesis is not statistically more powerful than 
the reduced-form approach, it is considerably more illuminating, 
since it permits the recovery of the structural parameters of the 
system. 

The empirical results indicate that the observed sensitivity of con- 
sumption to current income is greater than is warranted by the per- 
manent income-life cycle hypothesis, even when the role of current 
income in signaling changes in permanent income is taken into ac- 
count. The restrictions implied by the permanent income-rational 
expectations hypothesis can be rejected statistically at very high 
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confidence levels. Further, the estimates of the marginal propensity to 
consume out of current income are quite large, indicating that the 
failure of the permanent-income hypothesis is quantitatively, as well 
as statistically, significant. 


I 


In its simplest form, the permanent-income hypothesis holds that 
individuals ought to gear their consumption to their lifetime re- 
sources rather than to their measured income in some arbitrary time 
period. To start with a particularly simple case, consider the con- 
sumption behavior of an infinitely lived individual in a world in which 
the return to real wealth is certain and constant over time. The total 
resources available to the individual for current and future consump- 
tion consist of current real wealth, w,, plus current and future labor 
income, x;45, 5 = 0, 1, 2,...,. Labor income is assumed to be 
exogenously determined. The individual’s lifetime resources can be 
represented in either stock or flow form—the stock form being net 
worth, or the sum of real wealth and the present discounted value of 
current and future labor income, and the flow form being permanent 
income, or the annuity value of net worth. 

Since the path of future income is not known with certainty, the 
individual must make his consumption plans on the basis of some set 
of expectations about future income. Given the expectations about 
future income held in period t, the individual’s permanent income 
can be expressed as 


ye = r|wy + ie ; =) Besse (1) 


In this discrete time formulation, w, represents real wealth at the 
beginning of period ¢; x, is assumed to be paid at the end of the 
period; and the real rate of return, 7, is assumed constant. 

Permanent income can be thought of as the constant resource flow 
which, conditional on expections in period ¢, can be sustained for the 
remainder of the individual’s time horizon. Thus, permanent income 
ought to have the property 





Eyes = S- (2) 


To verify that permanent income, as specified in equation (1), has 
this property, note that real wealth will evolve according to 


Wer, = (LE mw, tx, mae: (3) 


if the individual’s consumption is identically equal to his permanent 
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income in each period. Therefore, y?,, will be related to y? according 


to the equation: 


2 l st+1 
Were r{( aa T Ui pete Xen ats Ds A a -] Evixersti| (4) 
s=0 





Rewriting equation (4) to separate out the component based on 
information which becomes available after time ¢ and rearranging 
gives: 


Me 





Gee ra + rf in F ay) Boxes] — yp 


s=0 





(5) 
2 s+1 
if my (| (Bet Edxesses} 
Substituting in (1/r)(yP) for w, + 29 [1/1 + 1) B+ Exp, gives: 
= s+1 
Veiga oat a? 2 ( ; A (Esty — Et) Xt+s+1- (6) 


If the expectations of future income are rational, the expectation of 
next period’s revision in expectation, (E44, — E;)x14341, is zero. Thus, 
Evytu = x 

Allowing for a stochastic, or transitory, component of consumption, 
the consumption function for the representative individual becomes 
Ce = ye + UW, OF 





a l s+1 
Ce TW. ( Exxs| + Ut, (7) 
: i > aan: 


where the error term u; represents the transitory component of con- 
sumption. Solving for c,4,; in terms of ¢;, given the wealth constraint 
Wey = Wier aig a) Xe ec ives: 


Shei 
Cee Cf ( ) 
ms Ler 


Note that the coefficient of u,; is not —1 but pple en?) mOLlictly 
speaking, permanent income and consumption will evolve as random 
walks only if the transitory consumption term is identically zero, uw, = 
0. Since permanent income represents the constant flow which can be 
sustained without changing net worth (in the infinite horizon case), 
consumption in excess of y? in a given period reduces next period’s 
net worth by w and the annuity value of net worth, Yb, by rug: 


8+ 


1 
(a Eke Crus Uri. (8) 





oo 


l S+1 
em Veet e aed (Ets: — Es)Xttst1 — Tug. (9) 


s=0 
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The remainder of the paper discusses two approaches to testing the 
permanent income-rational expectations hypothesis. Both ap- 
proaches take as their starting point the predicted relationship ex- 
pressed in equation (8) between consumption, lagged consumption, 
and the revision in the expected lifetime budget constraint attribut- 
able to new information available since last period. 


II. A Reconciliation of the Hall and Sargent Papers 


Sargent substitutes average discounted expected future income, (1 — 
a) 23-6 aE 145, where a represents the discount factor, 1/(1 +r), into 
the consumption function c, = By? + wu, to yield (p. 687): 


2 
ce = BL — a) > aE gig; + wy. (10) 
j=0 
This section argues that average future income is not an appropriate 
definition of permanent income and that once the definition of per- 
manent income is corrected, Sargent’s analysis leads to the same 
parameter restrictions as those tested by Hall. Briefly retracing Sar- 
gent’s derivation of the restrictions, average future income for date t 
is expressed as a weighted average of current income and average 
future income for date t + 1: 


& = B\( —a)y + (1-a)a Y WE er s| Pru (11) 


j=0 

Since consumption in period ¢ + 1 will be proportional to average 

expected future income in period ¢t + 1, according to Sargent’s 
specification, this equation can be written equivalently as 


= B[(1 —a)y + a St it |] + (12) 


Or 
Ce = AE ty, + BIL — a)y, + my — aE ys, (13) 


(Sargent’s eq. [18]). 

Equation (13), projected on information available at time ¢ — 1, is 
the restriction imposed across the autoregressive representations of 
consumption and income: 


— 1 
Fe-1Ct41 = = Eet = pL f, i Etna + Fy. (14) 


Rather than restrict the marginal propensity to consume (MPC) out 
of permanent income to be one, as in Section I, Sargent uses the more 
general consumption function c, = By? + u;. The basic point—that 
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discounted future income is not the appropriate definition of perma- 
nent income—is not affected by relaxing the restriction that B = 1. In 
order to accommodate Sargent’s more general consumption function, 
the analysis of Section I will be generalized to allow for an unre- 
stricted MPC. 

Based on the definition of permanent income as the annuity value 
of net worth, equation (1), and the consumption function with unre- 
stricted MPC, c;,, will be related to ¢ as follows: 


] 


Ca. = (1 +r(1 —B)lc, + rB ee 


Stout 
(EA Ext 15) 


= (1 Se r)Ut a Ut41- 


If B # 1, consumption will be a random walk with drift instead of a 
simple random walk. Taking the expectation in period t — 1 gives: 


Been 1) Pr — Bed — Ue perth, ae tig aller a (16) 


In comparing equation (16) with Sargent’s equation for E;,_,¢;4, (14), 
remember that l/a = 1 + 7. 

The presence of the term B(1 — a)E;_,y; in Sargent’s equation for 
Ey-1¢141 1s an artifact of his definition of permanent income as average 
future income. Sargent’s income series, yz, is disposable income and 
therefore includes future nonlabor income as well as future labor 
income. However, the present discounted value of the future path of 
nonlabor income is not equal to current nonhuman wealth. Consider 
the definition of permanent income as the annuity value of net worth: 


2S 1 st1 
y =r lw + eae E,x | (17) 
[ ; Ps l+r oe 


where x;;, 1s future labor earnings. The nonhuman wealth term, w,, 
could be expressed as the present value of a future stream of earnings 
rather than as a current stock. Nonhuman wealth w,, if maintained at 
that level forever, would yield a stream of nonlabor income of rw, each 
period. Thus, 





2 1 s+1 
wes iDig | rw. (18) 


s=0 


Note, however, that the nonlabor income in the present value term is 
rw,, the future nonlabor income the individual would receive each 
period 7f nonhuman wealth were maintained at wy. Expanding equa- 
tion (18) to isolate actual nonlabor income in each future period, ru;,,, 
gives 
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S ] s+1 = l Sail 
W, = aa a) Tee were) + Ds (3 : 7 ries (19) 





Equation (19) distinguishes between two conceptually different future 
revenue flows associated with w,; The second term is simply the 
present value of future nonlabor earnings, which of course depends 
on the path of nonhuman wealth over time. The first term reflects 
receipts from the sale of the wealth’s principal and payments to 
increase the principal. By defining permanent income as discounted 
future income, Sargent includes the second term but omits the first. 
Since the individual will systematically accumulate and decumulate 
nonhuman wealth over time in order to smooth his planned con- 
sumption path, the first term in equation (19) will not be identically 
zero or even constant over time. It is the omission of the term 2%, 
[1/1 + r)]}§*4r(w, — wi4s) from his definition of permanent income 
which is responsible for the discrepancy between Sargent’s version of 
the restrictions implied by the permanent-income hypothesis (eq. 
[14]) and the corrected version of the restrictions (eq. [16]). 

Sargent’s procedure is to impose the restrictions across the 
parameters of the vector autoregression of (c;, y;) implied by equa- 
tion (14). The unrestricted autoregression of (c;, y,) 18: 


BiG = C1Ve-1 1 Coleg F ... + Cn Ven 
gC Sais 5 Ciep she Ase C en (20) 


and 


Fest = Q1e-1 + Goyy-g +... + AnYt—n 


de Diep Vapor. « + Dyes (21) 


BINCe EC a ees ee Cae ee ee Vener aie) 
HGet 4h) os 6 rd C;44, substituting out forsE 47y,;and Fy24¢, 
yields an equation for E;y_,¢;4, in terms of y,_; and c_;,7 = 1, n, and 
the a;, b;, c;,d; parameters. Substituting these autoregressive represen- 
tations for Fy ,c;, Er1)yz, and E,_,¢c;4, into equation (14) gives a set of 
restrictions across the q;, b;, c;, d; parameters. These restrictions are 
tested by estimating equations (20) and (21) with and without the 
cross-equation restrictions imposed and computing a likelihood ratio 
statistic. 

Following Sargent’s procedure, but substituting the autoregressive 
representations of F,_,c; and F;_,¢y4; into equation (16), the “cor- 
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rected” version of equation (14), provides the following set of restric- 
tions on the parameters of the (c, y;) vector autoregression: 


C4, + Co fo GiCy ae [1 ae r(1 a B)\c, 


Cia, Pend, 6-7 = Serle Bes) 
oa, + a,c, =r’ Bie (22) 
66, + di +d, =[1+7r(1 —B)l]d, 


CORE ete Gide te dn = [1 =F r(1 5 B)]dn_; 
60,3 did, = (1-7 (1 — Byid,. 


This modified set of restrictions could be tested, following Sargent’s 
procedure, with a likelihood ratio test. However, rather than test the 
restrictions in their present form, I will argue that the time-series 
implications of the rational-expectations and permanent-income 
hypotheses can be stated and tested in a simpler form. 

The restrictions in equation (22) are based on the projection of 
equation (15) on information known at timet — 1. Projecting equation 
(15) on information known at time ¢ instead gives 


Er = (1 + 71 — B)lee — (1 + rue + Eptess, (23) 


which is the time-series restriction tested in Hall’s paper. Hall assumes 
that transitory consumption, u;, is identically zero. Therefore, as for- 
mulated by Hall, the testable hypothesis implied by (23) is that other 
variables dated ¢ — 1 or earlier have no additional explanatory value 
in predicting c; if c,_, is included in the regression. That is, equation 
(23) implies that 

d, =[1+r(1 —B)], 

Gr=05l 2 (24) 


Cp= 00 =8ly 3-2 0 


in the consumption autoregression (eq. [20]). The coefficient of 
lagged consumption, dj, is estimated unrestricted. Presumably d, will 
be close to one. 

The parameter values given in (24) satisfy the modified version of 
Sargent’s cross-equation restrictions (eq. [22]): If the parameters of 
the consumption autoregression (c;, d;) are set equal to the values 
specified in (24), the equation restrictions stated in (22) all reduce to 
the form 0: a = 0 or 0: 6 = 0, so that the restrictions are satisfied for 
arbitrary values of the parameters of the income process (a;, b;). That 
iS, imposing the restrictions d;= 0,4 = 2,....,nyand¢, =0,4= 1). =, 
n, and estimating the income autoregression unconstrained would 
satisfy the restrictions in equation (22). 
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Note that Sargent’s procedure yields restrictions in the form of a set 
of equations—one equation for each parameter of the consumption 
autoregression. In Hall’s paper, however, the parameters of the con- 
sumption autoregression are constrained to particular values—lagged 
variables other than the first lagged value of consumption are con- 
strained to enter with coefficients of zero. While correcting the 
definition of permanent income makes the two sets of restrictions 
consistent, in the sense that the set of parameter values in (24) satisfies 
the equation restrictions in (22), there remains the issue of why the 
two sets of restrictions are not identical in form. 

The set of equation restrictions in (22) is derived by modeling E,_,¢; 
as a general (unrestricted) autoregressive process, then solving for the 
restrictions across the parameters of the consumption and income 
autoregressions which are implied by equation (16). However, since 
equation (23) is also an implication of the rational-expectations and 
permanent-income hypotheses, no additional assumptions are re- 
quired to restrict the consumption autoregression to a first-order 
process. If the parameters of the consumption autoregression are 
assigned the values implied by equation (23), the equation restrictions 
in (22) all reduce to the form 0 = 0. That is, once the restrictions are 
imposed within the consumption autoregression, no further restric- 
tions are implied by the set of restrictions in (22). Thus the difference 
in form of the two sets of restrictions, (22) and (24), is attributable to 
the fact that, in deriving the restrictions in (22), E,_1¢; is modeled as an 
unnecessarily general autoregressive process. 

At this point, the conflict between the Hall and Sargent papers 
concerning the restrictions on the systematic part of the consumption 
autoregression has been resolved in favor of Hall’s formulation. The 
two authors also differ in their execution of the test, Sargent estimat- 
ing the autoregressions with quasi-differenced data and Hall es- 
timating the autoregression with the data in levels. 

Referring to equation (15), note that the disturbance in the regres- 
sion of c, on ¢,_, and other lagged variables will include a term which 
represents the revision in permanent income in period ¢ as well as a 
moving average of transitory consumption. Using &, to represent the 
revision in permanent income, 


oe 1 st+1 
é, = 7B near (Ee Ete: (25) 
: 2 l+r E 


the consumption autoregression, under the null hypothesis, is: 
Cr = [1 Sia r(1 = Byler Ar Et me (1 3 TL) + Ut- (26) 


Hall assumes that transitory consumption, w;, is identically zero. In his 
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analytical discussion, Sargent assumes u, is an ARI process, u, = 
pu;-; + @,, and quasi-differences (26): 


Ca PC pea cel Le FCB Ce ee 063) 


tS DS each @); (lace r)@y-1- eo 


In the econometric specification of the test, however, Sargent assumes 
that the disturbance in the consumption autoregression is uncorre- 
lated with the regressors (p. 689). The disturbance will be uncorre- 
lated with the regressors only if transitory consumption is identically 
zero, since w; is correlated with c¢,_,. Quasi-differencing the data in- 
troduces a moving average of the revision in permanent income term, 
é,. Since Sargent in effect assumes away transitory consumption by 
asserting that the error is uncorrelated with regressors, it seems more 
sensible to leave the regression in levels and thus preserve the white- 
noise property of the é, error term. 

The results of tests of the corrected version of the restrictions are 
presented in table 1. Sargent’s Y, income variable was constructed as 
Y, = GNP — capital consumption — state, local, and federal tax receipts 
+ transfers; his consumption variable C was total personal consump- 
tion expenditures. Both series are seasonally adjusted and measured 
in constant (1972) dollars. For comparison, table 1 also includes simi- 
lar tests using Hall’s income and consumption variables, which were 
per capita personal disposable income and per capita expenditures on 
services and nondurable goods. Hall’s data were also seasonally ad- 
Justed and stated in 1972 dollars? Following Sargent’s specification, 
all regressions included a linear time trend, denoted t. 

Sargent conducted his tests with the data in first differences, having 
assumed the autoregressive parameter in the transitory consumption 
process to be unity. Using differenced data, the test statistic for the 
corrected restrictions was 19.492, which indicates rejection of the 
hypothesis at the 99 percent confidence level but not at the 99.5 
percent level. Testing the incorrect restrictions, Sargent reported (p. 
696) a marginal confidence level of 1.000 when B, the MPC out of 
permanent income, was constrained to equal .90. The marginal 
confidence level of the test was only .697 when B was unrestricted; 
however, the resulting point estimate was “highly implausible” (6.06 
oa LOe), 

For the reasons stated above, I think that it is more appropriate to 
run the regressions in levels than in differences. With the data in 


* While the data set used to rerun Hall’s regressions was constructed according to his 
basic definitions of the variables, it is not an exact reproduction of his data set. The 
data set used in table 1 differs from Hall’s because it incorporates subsequent NIPA 


revisions of the latest observations and because a different price deflator was used to 
deflate the income series. 
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levels, the test statistic drops to 17.054, which is significant at the 97.5 
percent level. The difference in sample periods does not affect the 
results much. 

The results are sensitive to the definition of consumption, however. 
Using Hall’s data, which omit expenditures on durable goods from 
the consumption variable, the test statistic drops to 12.04, which is 
barely significant at the 90 percent level. Thus the difference in the 
consumption variables used by the two authors accounts for a major 
part of the gap between Hall’s results and those reported by Sargent. 
Correcting the restrictions and omitting the autoregressive transfor- 
mation of the data does have a noticeable effect, though, decreasing 
the marginal confidence level from the 1.000 reported by Sargent to 
97.5 percent. 


II. The Role of Current Income in Signaling Changes in 
Permanent Income 


The model in this section is based on the notion that, in forming their 
expectations of future income, rational economic agents will exploit 
the fact that income is a stochastic process which exhibits a high 
degree of serial correlation. Based on the observation that the current 
realization of income is greater than anticipated, agents will revise 
their expectations of future income, and therefore their permanent 
income, upward. Further, the magnitude of the revision in perma- 
nent income associated with the realization of a disturbance in current 
income will depend on the parameters of the time-series representa- 
tion of current income. In this section, the quantitative relationship 
between an innovation in current income and the revision in perma- 
nent income is derived for a general ARMA representation of the 
income process. 

This section uses the innovation in the income process to model the 
period-by-period revisions in permanent income and develop a sim- 
ple structural econometric model of consumption. The implications 
of the permanent income-rational expectations hypothesis are tested 
in the context of the structural model. The Hall-Sargent test of the 
permanent income-rational expectations hypothesis is then shown to 
be a test based on the reduced form of the structural model developed 
here. 


The Income Process 


More formally, the time-series behavior of aggregate income is mod- 
eled as an ARMA process which is driven by a series of independent 
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shocks or “innovations” in the process.? Thus, 


Ve = Pye i Payt-3, © - -o Pye 1, i,t 
PrEr-1 + Po» +... + fia, (28) 


or y, = [1 + M(L)]/[1 — A(L)]e, where L is the lag, or backshift, 
operator; A(L) = pit pol ee tL? ME) — "ob f+ oo? +... + 
o@,L%; and «€ is a white-noise disturbance. The income variable is ex- 
pressed in deviations from trend. 

An ARMA model of order (f, g) can be equivalently characterized 
as a pure moving average model of infinite order. That is, the time- 
series representation 


Ve Cpt Wree1 + Wo€s2 + WG-3 +... (29) 
is numerically equivalent to the model 


Ye = Prye-1 + Pot-o +... + Ppye-p + & 
F Oi€p-1 +. Poo 1 + Paet—g .(50) 
for Ws = os + Dae Pie 5. 
Expressed in the pure moving average form, the time-series model 
summarizes the relationship between the forecasted value of the vari- 
able and the realization of the disturbance term: 


dyt+5 as dy, ee 
ee aaa dees = We. (31) 


Since the disturbance terms are white noise by construction, the 
expectation of all future values of € is zero. When the current forecast 
error €; is realized, the predicted value of income at each date in the 
future is revised proportionally to «;: 


Etyt+s a Ev t+s = 1,67, (32) 


The present value of the revision, at time ¢, in the anticipated path 
of future income is 


> (a5) Boe: — Ey-1t4+s) = > (as) tole (33) 


3 It should be emphasized that the paper uses the ARMA framework to quantify the 
extent to which income is serially correlated only because the framework is a simple way 
of describing the time-series behavior of income on the aggregate level. The treatment 
of aggregate income as a stable ARMA process is not meant to imply that individuals 
view income as pure time series in forming expectations about their future income. On 
the individual level all that is assumed is that consumers will exploit the serial correla- 
tion that characterizes their individual income streams. For instance, a student who is 
unable to find a summer job realizes that summer jobs will probably be scarce next year 
as well; the production worker who is currently laid off realizes that he may not be 
rehired for some weeks; the stockbroker whose commissions have declined in volume 
may expect his income to remain below normal for 4—6 quarters. 
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In the infinite horizon case, the value of >» [1/(1 + r)}*W, can be 
calculated directly from the parameters ¢,, p; of the ARMA (p, 9) 
representation of the process. It is shown in Appendix I that 


eo 
ts ey: 
eg aaa oa a 


s=0 








Treatment of Unantictpated Capital Gains 


In the simplest version of the model discussed in Section I, the return 
to real wealth, 7, was assumed to be constant. This assumption implies 
that there are no unanticipated capital gains or losses on nonhuman 
wealth. In the absence of unanticipated capital gains, future labor 
income is the only component of permanent income which is not 
perfectly predictable. In this case the period-by-period revisions in 
permanent income would be modeled as proportional to the innova- 
tions in labor income alone. 

As an empirical matter, however, unanticipated capital gains and 
losses on nonhuman wealth probably constitute a significant fraction 
of the revisions in permanent income which this model is trying to 
capture. The following discussion describes the construction of an 
approximate measure of unanticipated capital gains on nonhuman 
wealth. 

Conceptually, the unanticipated capital gain on nonhuman wealth 
could be defined as the present value of the revision in the expected 
earnings associated with the current asset holdings: 


1 
l+r 





22 s+1 
capital gain in period ¢ + 1 = S | (Ert1 — Ex)rissw, (35) 
s=0 
where r,,, is the rate of return to capital in period ¢t + s. The actual 
path of nonlabor income, T1+sW+5, Will reflect both the “exogenous” 
movements in the return to capital, r44;, which we want to capture, 
and the endogenous changes in earnings flows which result from the 
individual’s decisions to accumulate or decumulate nonhuman 
wealth, w;,,. I assume that changes in the rate of return to capital, r;4., 
are quantitatively more important than the endogenous changes in 
W1+s in determining the time-series properties of the observed path of 
nonlabor income. With this assumption, unanticipated capital gains 


are approximated as the present value of the revision in expected 
nonlabor income. 
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Repeating the derivation of the consumption equation as in Section 
I, the unanticipated capital gains in real wealth enter the equation 
symmetrically with the present value of the revision in expected labor 
income: 


Sarl 


bp = 0254 i >| : ) (Ey — Ey-1) (X45 + vhs)| 





l+r 
(36) 


— (1 + Fit, + U;, 


where x; is future labor income and y*,, is future nonlabor income. 
Substituting into this equation the time-series characterization of 
the revision in the predicted path of future income yields: 





Si 1 \s*1 : 
=C1T 7 = ate als 
Ct t-1 | > i+ | se ( r uy 1 Ut, (37) 
where #, are the parameters of the moving average representation, 
and e, the innovations, of the disposable income series, y; = x; + yf. To 
simplify the notation, let ® = LZA1/(1 + 7r)Ist'bs. Using this nota- 
tion, the present value of the revision, at time ¢, in the anticipated path 
of future income is denoted ®e,; the revision in permanent income is 
denoted by r®e,. Equation (37) can be rewritten as: 


C= Cy IDG — (lit u,=, Au; (38) 


Finally, it is necessary to consider the effect on aggregate consump- 
tion of a trend in per capita income. While an individual’s permanent 
income can be defined so that its movement over time is trendless, the 
presence of a trend in per capita income will induce a trend in 
aggregate permanent income and therefore in aggregate consump- 
tion. If per capita income has a positive trend, due to increasing 
productivity, for example, later generations will have greater 
lifetime wealth than earlier ones. As the older generations die and are 
replaced by the younger generations, aggregate consumption will 
trend upward at the same rate of growth as per capita income. No 
change in the expectation of future income is involved in this trend 
movement of aggregate consumption, nor does any particular indi- 
vidual change his level of consumption. Since the model is intended to 
explain revisions in planned consumption which are caused by 
changes in the expectations about future income, equation (38) 
applies to the movement of consumption around a trend attributable 
to the trend in per capita income. 
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A basic empirical implication of the model developed in Section I is 
that, even if the behavioral marginal propensity to consume out of 
current income is zero, consumption should respond to innovations in 
current income because these innovations provide new information 
about future income and therefore induce revisions in permanent 
income. Expressing the income forecast error in terms of current and 
lagged income yields the following system of equations: 


Ye = Pi + Pie-1 + Pi He + Poyt—-p + €1t, (39) 
Nop te ERD Opa it py yet — 2 Pv) t-p) 
+ BoAy, + Bi Ayy_y + BoAyi2 +... + Bo-1A¥~@—1) + En, (40) 


where 





1 cs Pi P2 Pp 


WS Gs ues al a @lsery 


r is the quarterly interest rate, and k represents the annuity rate. (If 
the individual's time horizon is finite, the annuity rate will not be 
equal to the interest rate, as in the infinite horizon case.) 

In the unrestricted version of the model, the first difference of 
consumption responds to current and lagged changes in income as 
well as the innovation in the income process. In equation (40) the B; 
coefficients are measures of the “excess sensitivity” of consumption to 
current income, that is, sensitivity in excess of the response attribut- 
able to the new information contained in current income. The em- 
pirical work discussed in this section provides estimates of the B; 
parameters and tests the implication of the permanent income— 
rational expectations hypothesis that consumption exhibits no excess 
sensitivity to current income, B; = 0. 

The permanent income-rational expectations hypothesis could be 
tested by estimating the system in its structural representation, 
equations (39) and (40), by full information maximum likelihood 
(FIML) both with and without the restriction B; = 0 and forming a 
likelihood ratio statistic. 

As discussed below, the unrestricted version of the System is just 
identified. Because the system is Just identified, the maximum likeli- 
hood estimates of the structural parameters can be recovered from 
the estimated coefficients of the reduced form. Working with the 
reduced form has several advantages: First, estimation of the reduced 
form is computationally simpler than estimating the structural repre- 
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sentation by FIML; second, analysis of the reduced form clarifies the 
exact relationship between the likelihood ratio test of 8; = 0 proposed 
here and the F-test proposed by Hall. It will be shown that Hall’s test is 
a test based on the reduced form of the consumption equation (40). 
Further, it will be shown that because the structural system (eqq. [39] 
and [40]) is just identified, the test statistic from Hall’s test, based on 
the reduced form of the consumption equation alone, is numerically 
identical to the test statistic for the hypothesis 8; = 0 in the two- 
equation structural system. For the purpose of determining the statis- 
tical significance of a departure from the permanent income-rational 
expectations hypothesis, both Hall’s test and the test proposed here 
yield identical answers. For the purposes of assessing whether or not 
an observed departure from the hypothesis is of quantitative 
significance, however, the test proposed here has the advantage of 
permitting the recovery of the point estimates of the structural 
parameters, the 8;, and their individual standard errors, whereas 
Hall's procedure estimates only reduced-form coefficients. 


Identification of the Model 


Before discussing the identification of the system, I will make explicit 
my assumptions concerning the properties of the income-forecast 
error, €,, and the disturbance in the consumption equation, €. 
Conceptually, one would like to estimate the 1-period-ahead fore- 
cast of income, conditional on all available information, E(y,| 724): 
Since the estimated prediction equation (39) includes only a very 
limited subset of the available information, the predictive value of all 
the lagged values of variables observed by the individual, but not 
explicitly incorporated in the regression, will be reflected in the error 
term, e€,. Using the notation R(L)x,_, to represent the discrepancy 
between the 1-period-ahead forecast of income conditional on the 
complete information set J,_, and the income forecast conditional on 
the limited information set actually included in the regression, R(L)x;_, 


= E(y:| Te) - E(y:|ye-1, ++ 5Jt-p), the disturbance term e, in equation 
(39) will be the sum of the “true” innovation é,, and the R(L)x;,_, term, 
Ei, = €y + R(L)x,_4. (41) 


The term R(L)x;_, can be thought of as measurement error due to the 
omission of elements of the available information set from the fore- 
casting equation. The R(L)x,_,; term is assumed to be serially uncorre- 
lated and uncorrelated with the explanatory variables included in the 
income-prediction equation. By definition, the “true” innovation €, is 
uncorrelated with the R(L)x,, term as well as the included ex- 
planatory variables. 
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Since the “true” income innovation, €;; = Yt — Piyi-1 — +++ — Pryt-p — 
R(L)x;,_1, is the explanatory variable in the consumption equation, the 
consumption disturbance €2; will contain the term —k®R (L)x;_,, where 
k® is the coefficient of the true innovation. 

The systematic part of the consumption equation models the re- 
sponse of consumption to the new information contained in the 
current observation on income. Obviously there are many sources of 
new information about future income in addition to the current 
observation on income itself. These omitted current variables which 
induce individuals to revise their expectations about future income 
will also contribute to the disturbance in the consumption equation. 
Let the variable 6, represent the revision, in period ¢, in permanent 
income induced by other (nonincome) new information about future 
income. The “omitted news” disturbance 6; is assumed to be serially 
uncorrelated and uncorrelated with lagged variables. 

Because consumption is one component of current income, the 
omitted news term will have a nonzero correlation with the true 
innovation in income, €,,. If, in period ¢, current variables other than 
income induce individuals to revise their forecasts of future income 
upward, 6; will be positive. The positive realization of 0; increases 
current consumption and therefore increases current income. This 
implies that the omitted news term 9, is positively correlated with the 
true income innovation, €,;. 

In addition to the measurement error term, —k®R(L)x;_,, and the 
omitted news term, 6;, the consumption equation will, in principle, 
also include a moving average of transitory consumption, n, — (1 + 
k)n;-1. For the purpose of estimating the model, however, the tran- 
sitory consumption component of the error term will be ignored. 
Even though an individual’s transitory consumption in a given quar- 
ter may be nonnegligible in relation to his permanent consumption, 
the per capita value of transitory consumption will be small relative to 
permanent per capita consumption if the individual realizations of 
transitory consumption are largely independent across the popula- 
tion. I think that it is reasonable to assume that the quantitative 
contribution of the transitory consumption term to the overall error 
€.; is small compared to the contribution of the measurement error 
term, —k®R(L)x,;_,, and the omitted news error term, 6. 

This discussion leads to the following characterization of the struc- 
tural disturbances: 


€y, = R(L)xy-y + Ext (42) 


and 


E€ot = —KOR(L ioyeg + O;. (43) 
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Based on these assumptions, €, and e, are contemporaneously corre- 
lated, 


COV (€17, €¢) = —k® var [R(L)xp_1] + cov (O;, E12), (44) 


but not serially correlated. 

In order to simplify the discussion, identification of the system will 
be analyzed for a first-order AR income process. The generalization 
to a longer AR process is straightforward. 

The unrestricted system is: 


Yt = Ba + Pye-a + Ey (45) 
and 
AG; = Pe =i k®e,, = BAy, +h €or, (46) 


where ® = 1/{1 — [p/(1 + r)]}. The value of the parameter k, which 
represents the marginal propensity to consume out of wealth, is 
imposed a priori. The parameter ®, which represents the change in 
wealth associated with an innovation in income, is a function of the 
interest rate and the autoregressive parameters of the income process. 
The value of the interest rate is imposed a priori, so the identification 
of p identifies ® as well. 
Since €1, = y; — py;_1, the consumption equation can be rewritten as: 


Ac, = Me + (kB + Bese + pip — 1)yy-1 + €54. (47) 
The covariances of the structural disturbances will be denoted as 
follows: var (€,,) = 03; var (€,) = 0%; and cov (E14, €24) = Cy. 
The parameters of the income process, 444, p, and o?, are identified 
by the income data alone: 


COV (Yt, Ye-1) = p var (y;), (48) 
var (y) = p? var (y,) + o, (49) 

and 
ae lig 0) py. (50) 


where py, is the mean of y. 
Using the fact that €,, is by construction orthogonal to y, and €,, is 
assumed uncorrelated with y,_,, 


cov (Ac, ¥¢-1) = Bip — 1) var yy, (51) 
cov (Ac, y¢) = B(p = l)y'cov (Yt, Ye-1) + (RAB + Bo? + O42, (52) 


var (Ac;) = B?(p — 1)? var (y,) + (k® + B)?0? 
tahoe Boys + as) (53) 
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and 


M2 = Macy He Bd = P) Muss (54) 


where Mac, 1S the mean of Acy. 

The model is just identified—equations (48) through (54) constitute 
a system of seven equations in seven unknown parameters. Because 
the model is just identified, indirect least squares—that is, estimating 
the model as a reduced form, then transforming the reduced-form 
coefficients to recover the structural parameters—is equivalent to 
FIML. 

Consider the following reduced form: 


Yt = Bi + PYe-1 + Vit (55) 
Ac, = Be + B(e — 1)ye-1 + Vee, (56) 


where vy, = €, and vy = (kD + Bex + €. The covariance matrix of 
the reduced form, written in terms of the structural covariances, is: 


o? (kD + B)ot + Ov 67) 
ms a + Boz +o (kD + Bo? + ARP + B)ow + 03] 


Note that the maximum likelihood estimates of the structural 
coefficients—p,, M2, p, B—are independent of the value of k®, the 
parameter which measures the predicted response of consumption to 
the innovation in current income. That is, the parameters fy, Me, Pp, 
and £ are identified by equations (48), (49), (50), and (54), none of 
which contains k®. 

Examination of equations (53) and (54), or, equivalently, the 
reduced-form covariance matrix, equation (57), indicates that a priori 
restriction on the value of k® makes it possible to recover the struc- 


tural covariances oj, and o3 from the estimated covariances of the 
reduced form: 


COV (Vit, Vat) = (kD ate B)o% + O71 (58) 


and 
var (vo) = (k® + B)’o? + 2k® + B)ors + OF. (59) 


Since indirect least squares and FIML are equivalent in the just- 
identified case, the likelihood ratio statistic for the test of the restric- 
tion B = 0 computed by estimating the structural form, equations (45) 
and (46), by FIML will be numerically identical to the likelihood ratio 
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test statistic for B = 0 computed by estimating the reduced form, 
equations (55) and (56), by multivariate least squares.* 


Comparison of the Test of B; = O with Hall’s Test 


In the equation system actually estimated, income was modeled as an 
eighth-order pure autoregressive process: 


JE I Peay Poet eet Psye-s + E12, (60) 
Acy = fe + kD(y; — pry — Pit-1 — P2yt-2 — - - - — Pegyt-s) 


(61) 
ae BoAy: a B,Ay-1 ct oaehes ate BAyt-7 + Ep. 


Like the simpler system discussed above, this system is just 
identified. If a less general alternative hypothesis were specified —for 
example, if only the contemporaneous and three lagged first dif- 
ferences in income were included in the unrestricted model—the 
system would be overidentified. The particular specification of the 
alternative hypothesis was deliberately chosen in order to achieve just 
identification. Because the model is just identified, the maximum- 
likelihood estimates of the structural parameters can be obtained by 
estimating the reduced form: 


Se i pen ot Payeo Ft PsVi-g TP Vy; (62) 
and 


Moe= Ape + Bol) + Bol(p1 — 1)yy-y + P2yr-2 +... + Psyyg) 
+ Bi Ay, + Body. +... + BrAy_7 + vy, 


where vy = € and vy, = (kB + Boley + Ey. Using the notation 7; to 
denote the reduced-form coefficient of y,_;, 


(63) 


Ac, = fi, + Miia Wopor We se Tera + Vo. (64) 


Equation (63) makes very explicit the relationship between the 
reduced-form coefficients 7; and the underlying structural parame- 


ters: 


4 Eq. (51) suggests a simple instrumental variables estimator of the excess sensitivity 
parameter B. Noting that (p — 1) var y, = cov (Ay, y-1), Eq. (51) can be rewritten as 


cov (Ac;, y,-1) = B cov (Ayy, Ye-1)- (51’) 


Thus 8 is identified as the ratio [cov (Ac, y--1)]/Lcov (Ay, y.-1)]. Writing the consumption 
equation with the income innovation in the error term, Ac; = pt. + By, s (ADe,, + Ex); 
one could estimate 8 by using y,, as an instrument for Ay,. Lagged income, y,_,, is 
clearly a valid instrument for Ay,, since it is correlated with Ay, but uncorrelated with the 
composite error term, (kBe,, + €,). Using y,_, as an instrument for Ay, the instrumental 
variables estimate of B is [cov (Ac, y,-1)]/[cov (Ay, y¢-1)], which is exactly the expression 
for B obtained from eq. (51). 
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fy = Me + Bobi 
7, = BoP: — 1) + Bi 
2 = BoP2 — Bi + Bo 
73 = Bobs — Bot Bs 
(65) 


77 = Bolz — Be + Bz 
T= Bops — Bz. 


Regardless of the time-series process describing income, the restric- 
tion that consumption exhibits no excess sensitivity to current income, 
Bo = Bi =... = Bz = 9, implies that the reduced-form coefficients 77; 
are all equal to zero. 

The reduced form expresses the conditional expectation of the 
endogenous variable, Ac;, given the predetermined variables, which 
in this case are the lagged values of income. Thus, the empirical 
content of the restriction that By = 8; = ... = B; = 0 is that the 
conditional expectation of Ac; given lagged values of income is zero. 
This implication of the permanent income-rational expectations hy- 
pothesis is precisely the hypothesis tested by Hall. Quoting from 
Hall’s paper (Hall 1978, p. 976), “The tests of the stochastic implica- 
tions of the life cycle-permanent income hypothesis carried out in 
this paper all have the form of estimating a conditional expectation, 
E(e | Cr-1, X¢-1), Where x;_, is a vector of data known in period t — 1, and 
then testing the hypothesis that the conditional expectation is actually 
not a function of x;_,.” 

The analytical approach taken by this paper was to formulate a 
general structural model in which consumption responded to both 
innovations in permanent income and changes in current income. 
The restrictions imposed by the permanent income-rational expecta- 
tions hypothesis on the structural parameters, 8B; = 0, were shown to 
imply the restriction that the reduced-form parameters 7; were equal 
to zero. Hall’s approach to formulating a test of the hypothesis 
avoided the econometric specification of a structural consumption 
equation. Eschewing structural estimation altogether, Hall analyzed 
the restrictions imposed by the permanent income-rational expecta- 
tions hypothesis on the conditional expectation of Ac, given lagged 
variables. Thus Hall arrived at the restrictions imposed on the 
reduced-form consumption equation by an analytical shortcut; rather 
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than derive the reduced-form restrictions from an explicit structural 
econometric model, he exploited the interpretation of a reduced form 
as stating a conditional expectation and considered the implications of 
the permanent income-rational expectations hypothesis for the con- 
ditional expectation of consumption. 

Although both Hall’s analysis and the structural model developed 
in this paper lead to the same restrictions on the reduced form of the 
consumption equation, the test procedure used here differs from 
Hall’s in that the restriction is tested by computing the likelihood ratio 
statistic for the two-equation reduced form (eqq. [62] and [63]), while 
Hall’s test statistic is computed by estimating the consumption equa- 
tion (eq. [64]) alone. As proved in Appendix IT, however, the two tests 
yield numerically identical values of the likelihood ratio statistic for 
the hypothesis that the conditional expectation of consumption given 
lagged income is zero.> 

The basic advantage of the structurally derived test over Hall’s test 
is that it permits the recovery of the point estimates of the structural 
parameters measuring the excess sensitivity of consumption to cur- 
rent income, the £;, and their individual standard errors. With Hall’s 
test it is difficult to interpret the quantitative importance of a depar- 
ture from the predicted behavior of consumption since only the 
reduced-form coefficients are estimated. 


Data 


The test of the restriction B; = 0 is a test of the joint hypothesis that (a) 
consumers exploit the time-series properties of income in forming 
their estimates of permanent income and (6) the adjustment of con- 
sumption to a revision in permanent income is fully achieved within 
the quarter that the new information becomes available. In order to 
limit the consumption concept to a component which can be adjusted 
rapidly and smoothly to changes in permanent income, the consump- 
tion concept used as the dependent variable was expenditures on 
nondurable goods. Even if one went to the trouble of converting the 
National Income and Product Accounts (NIPA) data on expenditures 
on durable goods into a series of imputed service flows, the existence 
of substantial transaction costs of adjusting stocks of durable goods 
suggests that the consumption of services of durable goods would 
exhibit lagged adjustment to changes in permanent income. The 


° I am indebted to Hall for suggesting that I estimate the model by indirect least 
squares instead of FIML. The observation that the test based on the two-equation 
reduced form and the test based on the reduced form of the consumption equation 
alone will yield numerically identical values of the likelihood ratio statistic is also due to 
Hall. 
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NIPA data on consumption of services are subject to a similar objec- 
tion, since services are defined to include the imputed service flow 
from housing. Nondurable goods represent about 45 percent of total 
personal consumption expenditures. 

Both the consumption series and the income series, which was 
disposable personal income, were seasonally adjusted and expressed 
in real per capita terms. After allowing for the construction of the 
lagged variables, the sample period used in the estimation was 
1949:111-1979:1. 

The income process was modeled as an eighth-order autoregressive 
process around an exponential trend. In order to simplify the estima- 
tion, the income data series was first detrended by its estimated 
exponential trend for the sample period 1947:1-1979:1 of .00565259 
per quarter, and the model was estimated with the detrended series. 
As mentioned above, the presence of a trend in per capita income will 
give rise to a trend in per capita consumption. Because the consump- 
tion variable is expenditures on nondurable goods rather than total 
consumption, the trend in nondurable goods consumption will not be 
equal to the trend in income unless the income elasticity of nondura- 
bles consumption is unity. For this reason, the consumption series was 
detrended by its own estimated trend, which was .0032616 per quar- 
ter. Both the income and consumption series were expressed in units 
of (constant) dollars per capita. 


Empirical Results 


The reduced form was written out in terms of the structural param- 
eters and estimated by multivariate least squares. The unconstrained 
system estimated was: 


Ye = Br + pPiye-1 + Poyt-o +... + Polis + Vit (66) 
and 
Ac; = flo + Bol(p1 — Lye + P2di-2 + - - - + psyi-s] + BiAye-1 (67) 
Pa oY zoe BrAyi_-7 + Vee- 


The point estimates and their standard errors are reported in table 
2A. 

Imposing the restriction By = B, =... = B; = 0, the constrained 
system consisting of the income equation (66) and the consumption 
equation 


Ac, = Pe + Vee (68) 


was also estimated by multivariate least squares. The point estimates 
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TABLE 2 


A. Point EsTIMATES AND STANDARD ERRORS FOR Equations (66) AND (67) 

















My pi Po Ps Ps Ps Ps pa Ps 
92.54 964 .069 074 —.360 112 157 -.008 —.053 
(64,26). 94-092) —-C126)— C125) © G122)5— (123) = (124) C122)... 085) 
He Bo Bi Bz Bs Ba Bs Bs Br 

32.71 355 071 .049 —.116 114 -.073 050 ~.012 


(24.95) (.275)  (.036)  (.038) (.050) (.071) (.046) (.036) (.035) 





B. Point EsTIMATES AND STANDARD ERRORS FOR EQUATION (68) 























Mi PP P2 P3 P4 Ps Pe Pz Ps He 


55.62 SOOO OO a ee 2d Le 247/16 200045 06Gb Ry —2045 .068 
(S/H) COS) wn (cli dicd) eel! 12) (11 O) atl 01) a (eal 111) rl ON 107.6) (.862) 











C. REDUCED-FORM COEFFICIENTS AND STANDARD ERRORS FOR EQUATION (69) 





He Ty 2 13 14 Ts 76 i Cur) 








Baia .058 O02 138 03 ela ol = {Nas} —.007 
(25.95) (2037) 9 4GO51) (.050) (.049) (.050) (.050) (.049) (.034) 





and standard errors, based on the constrained estimation, are re- 
ported in table 2B. 

Using the notation L, and L, to denote the log likelihood of the 
constrained and unconstrained systems, respectively, the likelihood 
ratio statistic A = —2(L, — L,) for the hypothesis By = B; = ... = By = 0 
is distributed x?(8). The value of the likelihood ratio test statistic, \, 
was 27.024. Since the critical value of the x?(8) distribution is 21.96 
for the 0.5 percent significance level, a test statistic of 27.0 represents 
a very decisive rejection of the hypothesis. 

For comparison, Hall’s version of the test using the reduced form of 
the consumption equation alone, 


Ac; — be aS 71 Yt-1 a T2t-2 isere tts T3Vt—8 =r Vit, (69) 


was also estimated. The reduced-form coefficients and their standard 
errors are reported in table 2C.° The likelihood ratio statistic for the 
hypothesis that By) = 8; = ... = B; = 0, itis necessary to compute the 


computer work confirms the assertion that the test of By = B, =... = 


6 The estimated constant term in the constrained version of eq. (69) was .068, with a 
standard error of .866. 
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TABERSS 


STANDARD ERRORS OF REGRESSION AND DuRBIN-WATSON STATISTICS 








SYSTEM ESTIMATED 














‘Two-Equation Reduced Form Single-Equation Reduced Form 
Unrestricted Restricted Unrestricted Restricted 
y Ac y Ac Ac Ac 
SEROGe. te $22.50 $8.73 $23.04 $9.45 $8.73 $9.45 
DEWiea as 2.00 Zale 1.90 2e3 2.15 23 





Notr.—y and Ac = dependent variables. 


8; = 0 in the context of the two-equation reduced form (eqq. [66] 
and [67]) and the test of 7, = 7. =... = ws = 0 in the context of 
the reduced-form consumption equation alone, equation (69), yield 
numerically identical values of the likelihood ratio statistic. 

The standard errors of regression (SER) and Durbin- Watson statis- 
tics are reported in table 3. The SER of each equation was computed 
by dividing the sum of squared residuals by the degrees of freedom 
rather than the number of observations. The standard errors have the 
interpretation of the dollars per capita prediction error of each equa- 
tion. For comparison, the standard deviations of y and Ac were 
$71.57 and $9.45, respectively. 

The discussion of the identification of the system showed that the 
maximum-likelihood estimates of the B; parameters were inde- 
pendent of the value of k®, the parameter which measures the pre- 
dicted response of consumption to an innovation in current income. 
While the identification of the parameter k® is not required to test the 
hypothesis that 8) = 6, =... = 8, = 0, itis necessary to compute the 
value of k® in order to recover the parameters of the covariance 
matrix of structural disturbances. 

By substituting the estimated parameters of the income process into 
equation (34), one can compute the value of ®, which gives the 
capitalized value of the revision in expected future income or the 
change in wealth associated with an innovation in current income. For 
the estimated parameters of the income process, and an annual inter- 
est rate of 5 percent, an innovation in current income implies a 
change in wealth of 17.80 times the income innovation.7 Assuming an 


In order to simplify the estimation, the income process was modeled as an 8-quarter 
pure AR process. Formal tests of more general mixed ARMA models of the income 
process identified the (3, 0, 4) specification (three AR parameters and four MA 
parameters) and the (1, 0, 4) specification as the most parsimonious mixed ARMA 
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average horizon of 20 years, the annuity rate is 0.0198 per quarter.’ 
The change in permanent income associated with an innovation in 
current income is (0.0198)(17.80), or 0.35 times the income innova- 
tion. Since the consumption variable used to estimate the system was 
nondurables consumption rather than total consumption, the 
parameter k® will be the product of the marginal propensity to 
consume nondurables out of permanent income and the change in 
permanent income induced by an innovation in current income. With 
nondurable consumption accounting for 47.5 percent of total con- 
sumption, k® = (0.475)(0.0198)(17.80) = 0.1675. 

The estimated covariance matrix of the disturbances of the unre- 
stricted reduced form is: 


— [467.808 Urea 
TO SoZ FOOL iol 


Having identified the value of k® as 0.1675 and the value of B, as 
.355, the covariance matrix of the structural disturbances can be 
recovered using equation (57). The estimated covariance matrix of 
the structural disturbances is: 


£=| 467.808 wsakshPee| 
— 164.538 114.745] 7 


Note that the covariance between the structural disturbances of the 
two equations, cov (€;, €), is negative and very large relative to the 
variance of the disturbance in the consumption equation, €,. In 
the discussion of the properties of the structural disturbances, it was 
pointed out that the omission of elements of the available information 
set from the income-forecasting equation implies that the income 
residual is subject to measurement error and that the presence of 
measurement error gives rise to a negative covariance between the 
structural disturbances. The large negative estimate of cov (€,, €) 
suggests that the measurement error introduced by using the 
income-forecast error as a measure of the “true” income innovation 
cannot be dismissed as negligible. ‘The finding that the measurement 
error is quantitatively important indicates that attempts to estimate k® 


representations of the income process. The value of ® implied by the (3, 0, 4) model 
was 16.41; by the (1, 0, 4) model, 15.45. Since the ARMA models identified through 
formal time-series analysis of the income series yield values of ® very similar to the 
value of ® computed from the 8-quarter AR process, there is no indication that the 
assumed pure AR process is inadequate in capturing the time-series properties of 
income. 

8 The quarterly annuity rate was calculated with the following formula: annuity rate 
= r/(1 — e~"7). The quarterly interest rate, r, was assumed to be 0.0125; the horizon of 
the representative individual, 7, was assumed to be 80 quarters. 
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directly by doing regressions of the change in consumption on the 
income residuals will be subject to severe downward bias. 

The discrepancy between the predicted and observed behavior of 
consumption is quantitatively large as well as statistically significant. 
Since the dependent variable is consumption of nondurable goods 
rather than total consumption, the A; coefficients represent the excess 
sensitivity of nondurables consumption alone. Considering that non- 
durable goods consumption represents 45 percent of total consump- 
tion expenditures, the point estimates of Bo, B,, and B, of .355, .071, 
and .049, respectively, indicate a strong “excess” response of con- 
sumption to current income. Unfortunately, the estimate of the 
parameter By is imprecise, as indicated by its fairly large standard 
EIlLOL.Oln2 10; 

Because of the large sampling error of the estimate of the crucial 
parameter Bo, these tests stop short of providing conclusive evidence 
that the permanent income-rational expectations hypothesis fails in a 
quantitatively significant way. Stull, the large point estimate Bo, to- 
gether with the statistical rejection at the 0.5 percent level of the zero 
restrictions on the 6; coefficients, constitute significant, if not conclu- 
sive, evidence against the hypothesis. 

In Hall’s paper, tests of the predictive value of lagged income did 
not reveal strong evidence against the permanent-income hypothesis.® 
Hall concluded, “The results of this paper have the strong implication 
that beyond the next few quarters consumption should be treated as 
an exogenous variable. . . . With respect to the analysis of stabilization 
policy, the findings of this paper [support] the view that policy affects 
consumption only as much as it affects permanent income” (p. 986). 

Why is it that the tests reported here reveal substantial evidence 
against the permanent-income hypothesis, whereas Hall’s did not? As 
shown above, the test based on the two-equation reduced form is not 
intrinsically more powerful than Hall’s test. The structurally derived 
reduced-form test provides stronger results for two reasons, one 
obvious and the other more subtle. First, by permitting the recovery 
of the structural parameter estimates measuring the excess sensitivity 
of consumption to current income, the B;, the structurally derived 
reduced-form test makes it possible to assess the quantitative 
significance of a departure from the null hypothesis. 

Of the regressions reported by Hall, the regression of current 
consumption (of nondurable goods and services) on lagged con- 
sumption and four lagged values of income is closest to the reduced 


* Hall was able to statistically reject the permanent-income hypothesis based on tests 
of the predictive value of a stock price index. 
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form estimated in this paper. On page 983 Hall reports the following 
coefficient estimates and standard errors. 


( = 2a L.076¢._5 + 049y,_, a 05 194-2 3 023y1_3 =¥ 02494. 
(ident 04a)agn043)04 1 (-052)h 6 (051), 037) 60) 


Testing the hypothesis that the coefficients of all four lagged values of 
income were zero, Hall got an F-statistic of 2.0, which was high 
enough to reject the hypothesis at the 10 percent level, although not 
the 5 percent level. Noting that the first lagged value of income had a 
small positive coefficient and that the sum of the coefficients was small 
and negative, Hall concluded, “There is a statistically marginal and 
numerically small relation between consumption and very recent 
levels of disposable income” (p. 984). 

The statement that the relationship between consumption and dis- 
posable income is “numerically small” refers to the magnitude of the 
reduced-form coefficients, the 7;, not the structural parameters B;. 
Hall is explicit on the point that his regressions did not provide 
estimates of structural parameters: “No claim is made that the true 
structural relation between consumption and its determinants is re- 
vealed by this approach” (p. 977). 

The obvious advantage of the structurally derived reduced-form 
test is that it permits the estimation of the structural parameters. In 
equation system (54), the reduced-form coefficients are written out in 
terms of the underlying structural parameters. Equation system (68) 
indicates that even if consumption responds strongly to current and 
lagged changes in income, that is, the B; are large and positive, the 
reduced-form parameters will be small. Consider in particular the 
reduced-form coefficient of y,-;: 


7 = Bo(p: — 1) + By. (71) 


When nondurables consumption alone is used to form the dependent 
variable, the estimate of By) is .355 and the estimate of B, is .071. 
However, since the first autoregressive parameter in the income pro- 
CESS, P;, 1s estimated to be .964, the value of zr, is only .058. When the 
consumption concept is defined as nondurable goods and services, Bo 
and £, are estimated to be .461 and .128, respectively. The resulting 
reduced-form coefficient 7, in this case is .112. In addition to the 
4-quarter distributed lag, Hall reported the results for a test using a 
12-quarter Almon lag. In both cases the sum of the reduced-form 
parameters was small and negative. Using equation system (65), note 


that : 
Dim = Bo (DP 1) (72) 
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The sum of the autoregressive parameters of the income process is 
about .95 for quarterly data. Therefore, even if By) has a healthy 
positive value, the sum of the reduced-form parameters will be small 
and negative. 

While the test based on the reduced form of the consumption 
equation alone gives the same test statistic as the test based on the 
two-equation reduced form, it does not by itself provide any informa- 
tion about the quantitative importance of a departure from the null 
hypothesis. Observing that the reduced-form coefficients are small 
does not imply that the structural coefficients are small. To the con- 
trary, large positive values of the 6; coefficients are associated with 
reduced-form parameters which are small and which sum to a nega- 
tive value. 

The second, more subtle advantage of the structural approach is as 
follows: Because it requires the explicit specification of a structural 
model, the structural approach provides some guidance concerning 
the specification of the reduced form. 

While Hall’s approach of analyzing the restrictions on the condi- 
tional expectation of consumption led to the correct conclusion that 
the reduced-form coefficients of lagged variables should be zero, it 
did not suggest any criteria for selecting a particular collection of 
variables to be included in the reduced form. By studying the 
structural specification, one learns that, under the alternative hypoth- 
esis (By > 0), any lagged value of income which has a nonzero 
coefficient in the income autoregression will in general have a non- 
zero coefficient in the consumption reduced form. 

Further, if one believes that the income process is best modeled as 
an AR process around an exponential trend, then the consumption 
reduced form should be fit to the detrended income series rather than 
the raw income series. Alternatively, one could leave the income series 
in its original (not detrended) form and include an exponential trend 
term, e”’, in the consumption reduced form. In Hall’s original tests, 
the income series was not detrended and the reduced form did not 
include a trend term. If the income process does include a trend, 
Hall’s tests are misspecified under the alternative hypothesis. One 
would expect that the failure to allow for the trend in income, which 
causes the reduced form to be misspecified under the alternative 
hypothesis, would reduce the power of the test. As an empirical 
matter, the treatment of the income trend does have a noticeable 
effect on the value of the likelihood-ratio statistic. In all of the tests 
reported above, the income series have been detrended. 


© In a comment on Hall's paper, Startz (1979) pointed out that the inclusion of an 
exponential trend in the regression tended to increase the value of the test Statistic. 
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Having shown that the test based on the bivariate reduced form and 
Hall’s test based on the single-equation reduced form yield numer- 
ically identical values of the likelihood ratio statistic for comparable 
specifications, it is necessary to determine why the rational 
expectations—permanent income hypothesis is rejected at the 0.5 per- 
cent level in the tests reported here, while the hypothesis could not be 
rejected at the 5 percent level in Hall’s original tests. In particular, one 
would like to know whether the difference in the test statistic is 
attributable to differences in the data sets, or to differences in the 
specification of the alternative hypothesis, in terms of the number of 
lagged values of income included and the treatment of the trend in 
income. 

Hall used a broader concept of consumption—expenditures on 
nondurable goods and services—than the consumption concept, 
nondurable goods, used in this paper. Data on real per capita con- 
sumption of nondurables and services were constructed to provide a 
direct comparison to Hall’s original results. While the data set used to 
rerun Hall’s regressions was constructed according to his basic defini- 
tions of the variables, it is not an exact reproduction of his data set. 
The data set used here differs from Hall’s because it incorporates 
subsequent NIPA revisions of the most recent observations and be- 
cause a different price deflator was used to deflate the income series. 
In the regression of current consumption on lagged consumption and 
four lagged values of income, Hall obtained an F-statistic for the 
exclusion of the lagged values of income of 2.0, which is somewhat 
below the critical value of 2.45 for the 5 percent level. For the same 
sample period and specification, the test statistic was 2.75 using the 
reconstructed data set. Thus, minor differences in the construction of 
the data raise the significance level of the test statistic from somewhat 
below to somewhat above the 5 percent level. Consideration of the 
structural model indicates that the reduced form should include as 
many lagged values of income as the income autoregression and that 
some allowance for a trend in income should be made, either by 
detrending the income series or by including a trend term in the 
consumption reduced form. Using the original (not detrended) in- 
come series but increasing the number of lagged values of income 
included in the consumption reduced form from four to eight in- 
creased the value of the likelihood-ratio statistic from 10.95 to 22.616. 
This raises the significance level of the test from the 5 percent level to 
the 0.5 percent level. Detrending the income series also has a substan- 
tial effect on the test statistic. For the specification with four lagged 
values of income, the test statistic was 16.492 for the detrended 
income series (which is significant at the 0.5 percent level) as opposed 
to 10.95 for the raw income series. For the specification with eight 


1006 JOURNAL OF POLITICAL ECONOMY 


lagged values of income, detrending the income series raises the test 
statistic from 22.616 to 27.108. Thus, the insight concerning the 
appropriate specification of the reduced form which is obtained by 
explicitly formulating the underlying structural model was responsi- 
ble for raising the significance level of the test from a marginal 
rejection to a decisive rejection. 


IV. Conclusions 


This paper develops a simple structural econometric model of con- 
sumption. In the general specification of the model, consumption 
responds to the changes in permanent income signaled by innovations 
in the current income process and to changes in current income itself. 
The response of consumption to current income beyond that at- 
tributable to the role of current income in signaling changes in per- 
manent income is termed the “excess sensitivity” of consumption to 
current income. The econometric model is then used to estimate the 
excess sensitivity of consumption to current income and to test the 
implication of the permanent-income hypothesis that the excess sen- 
sitivity of consumption is zero. 

The tests reveal substantial evidence against the permanent-income 
hypothesis. Using either nondurables consumption or consumption 
of nondurables and services as the dependent variable, the hypothesis 
that consumption exhibits no excess sensitivity to current income can 
be rejected at the 0.5 percent level. Using expenditures on nondura- 
ble goods as the consumption variable, the point estimate of the excess 
sensitivity of consumption to the contemporaneous change in income 
is .355. Keeping in mind that nondurable goods represent only a 
fraction of total personal consumption expenditures—about 45 
percent—a point estimate of .355 represents a large departure from 
the permanent-income hypothesis. 

The paper also shows that the test of the permanent income-— 
rational expectations hypothesis proposed by Hall (1978) can be 
thought of as a test based on the reduced form of the structural model 
developed here. Rather than develop a structural econometric model 
and derive the associated restrictions on the reduced-form parame- 
ters, Hall’s approach was to analyze the implications of the permanent 
income-rational expectations hypothesis for the conditional expecta- 
tion of consumption given lagged variables. Since a reduced form has 
the interpretation of expressing a conditional expectation, Hall’s 
analysis led directly to testable restrictions on the parameters of a 
reduced-form consumption equation. When the structural model is 
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just identified, both tests yield numerically identical values of the test 
statistic. While the structural approach does not provide a statistically 
more powerful test, in general, than the reduced-form approach, it 
does have the advantage of providing estimates of the structural 
parameters of the model, including estimates of the “excess sensitiv- 
ity” of consumption to current income. 


Appendix I 


For the general model 


Jt = PrYe-1 T P2yt-2 +... + Ppye—p + €¢ + Pree + hore +... + boe-,. (Al) 
. dyr+5 - 
== by (A2) 
> dE a 


where the y; are the infinite series of moving average parameters in the 
pure moving average representation of the model: Me = & + Wey + Woe» 


+ .... Solving for w; in terms of the estimated p; and qj, 
Yo = bo = 1 (A3) 
dy, = Oy; a OY, dy,-; fori — il, co (A4) 
de; OE; =i OY; de; 
Wi= i+ > pas (A5) 
Fl 


where p; = 0 for) > p, d = 0 fori > gq, 


Bs Wi = bE pi + iS a PiWi-j (A6) 


i i=1 j=1 


adding by = bo, (A7) 


Si wi = ss: pi + > PiWi-j (A8) 
=0 i=0 1 j=l 


Y= dot Ved bes (A9) 
Sop: Wi (A10) 


i=0 i=0 j=1 i=0 
(1- So) m= d 4 (All) 
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Appendix II 

Proof that the likelihood ratio statistic for the test of y = 0 in the equation 
Ace = w+ Yye-1 + Mt (A13) 


is numerically identical to the likelihood ratio statistic for the test of 8 = 0 in 
the bivariate reduced form: 


(Tao, lee 2) (2% 2) (T X 2) 
Ly, Ac =(1 yealfor Me ie [vie Vee). (Al4) 
é B(p — 1) 


The GLS transformation of equation (A14) is obtained by postmultiplying by 
the lower triangular matrix P such that PP’ = (1, where { is the covariance 
matrix of the reduced form: 


ne 
Pe : 
Ee fa 


It will be convenient to normalize P such that the lower-right-hand element is 
equal to one. Define 
ae P 1 1 


Piel 
where P, =(P;,/Pa) and Py = (P.5/P.5)- Transforming (A14) by P yields: 
Pia + Pages aa (P yp, as P softs) + yi Pip = P 2B(p =) Pye (APD) 
and 


Ac, = Me + ¥-1B(p — 1) + Voz (A16) 


where 0,, = Py, + Pyvoy. 

Let L' denote the log likelihood of the transformed income equation (A15), 
L? denote the log likelihood of the consumption equation (A16), and L!? 
denote the log likelihood of the bivariate system. Since cov (¥4;, V2;) = 0, the 
value of the log likelihood function of the bivariate system is the sum of the 
values of the log likelihood of the two individual equations: L!? = L! + L?. 
Imposing the constraint that 8 = 0 reduces the value of L?, L? < L?. Imposing 
the restriction that B = 0 does not restrict the log likelihood function of the 
transformed income equation, since 4, and p are unrestricted parameters, L? 
= Ly. Therefore, all of the decrease in the likelihood function of the bivariate 
system is attributable to the decrease in the likelihood function of the con- 
sumption equation alone: Li? — Li? = L2 — L?. 
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Measurement of Monopoly Behavior: 
An Application to the Cigarette Industry 





Daniel A. Sumner 


North Carolina State University 


A simple scheme is proposed to measure monopoly pricing behavior. 
The coefficient of the tax rate term in a price equation identifies the 
ratio of price to marginal cost. No direct measurement of costs is 
required, so a major problem for other empirical studies of monopoly 
is avoided. Empirical results for cross-section time-series data sup- 
port rejection of atomistic competition but also provide evidence 
against the operation of an effective cartel in the cigarette industry. 
The model represents an alternative interpretation of related results 
in a recent paper by Barzel. Application of the methodology to other 
markets is feasible. 


I. Introduction 


The cigarette industry is often used as an example of a less than 
perfectly competitive industry. For example, in the famous Tobacco 
Case of 1946, the major cigarette manufacturers were accused of 
operating an illegal cartel; though they were convicted, the consensus 
was that company behavior was not altered by the verdict.! This paper 


This is a revised version of “Interstate Cigarette Price Differentials and the Mea- 
surement of Firm Level Demand Elasticities” (October 1979). I wish to thank Yoram 
Barzel, Robert Clark, James Easley, David Flath, Charles Knoeber, Michael Kusnic, 
Thomas MaCurdy, Michael McElroy, Ronald Schrimper, Michael Wohlgenant, and an 
unknown referee for their comments. This paper is no. 6626 of the Journal Series of 
the North Carolina Agricultural Research Service, Raleigh. 


* See Nicholls (1949), Bain (1968), or Tennant (1971) for discussion of the 1946 case 
and of the “structure, conduct and performance” of the cigarette industry. Stigler 
(1966, p. 244) presents a table of industries with high concentration; cigarettes rank 
fourth in the share of assets held by the “giant firms.” Friedman (1953, p. 37) also 
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attempts to investigate the alleged market power of cigarette man- 
ufacturers with a simple scheme for measuring the weighted-average 
price elasticity of demand facing firms in this industry. 

The empirical examination of the pricing behavior of the cigarette 
industry finds an absence of evidence of an effective cartel, but the 
hypothesis of atomistic competition is also rejected. The approach 
uses data on excise taxes, but it does not depend on measuring the 
cost of production. Since reliable cost data are not generally available, 
this paper avoids an important weakness of directly measuring 
monopoly power by the ratio of product price to marginal cost. The 
model suggested here is also applicable to other industries with varia- 
tion in excise taxes or some other directly measurable components of 
marginal cost.? 


II. A Simple Model 


The equilibrium output condition, that marginal revenue equals 
marginal cost, implies that 


Pa — Iai (ns a 1)]M,;, (1) 


where P;; is the product price, M,; is the marginal cost of production, 
and 7 ; 1s the price elasticity of the demand curve facing firm j. If the 
industry is purely monopolistic, n; is equal to the market elasticity 7”; 
while if it is perfectly competitive, n ; = —% and is independent of n”. 
Thus 7; is a measure of the degree to which the firm is not strictly a 
price taker.? However, since good measurements of M , are difficult to 
find, equation (1) does not offer a practical solution to the problem of 
determining potential monopoly power. 

For a product like cigarettes, which has a per unit excise tax, the 
marginal cost term may be expanded as 


M,;=X,B+€;+R, (2) 


discusses potential monopoly power of cigarette firms, predicting that in an analysis of 
response to tax rate changes, “broadly correct results will be obtained by treating 
cigarette firms as if they were producing an identical product and were in perfect 
competition.” 

? The empirical model derived in this paper is similar to that used for a different 
purpose by Barzel (1976) (and also in a comment by Johnson [1978]). The present 
paper was weil under way when the Barzel results were pointed out to me by Charles 
Knoeber. A paper by Manchester (1976) also presents (in an appendix) a regression 
related to those estimated below. However, Manchester does not interpret his tax rate 
coefficients in that paper. 

3 For multiproduct firms, optimal pricing decisions require attention to cross elas- 
ticities of demand. I will simply ignore this refinement and implicitly assume that it is 
not empirically important. In what follows I use the term “firm” to mean the relevant 
pricing decision maker for the given product. 
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where X;8 represents the set of observed factors determining mar- 
ginal costs, €; represents unobserved factors, and R is the tax 
rate, which does not vary by firm or brand. Equations (1) and (2) 
may be combined to show that even without direct measurement of 
M,;, the firm-level elasticity of demand may be estimated as a func- 
tion of the coefficient of the tax rate variable. 

Equations (1) and (2) apply to pricing decisions made by firms. 
However, since firm-level data on prices are generally not available, 
some discussion of aggregation is in order. The observations are on 
the average price of cigarettes, defined as 


eae 
j 


where J is the number of firms and S; is the market share of the jth 
firm. Analogously define 
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where 0; = n;/(n; + 1). From these definitions it follows that 6 = n/(n 
 |)rand 


J 
P= 96M + > S,(6; — 6)(M; — M). (3) 
J 
This result shows the relationship between aggregate prices and costs 
that follow from the firm-level theory. 

From equation (3) there are three conditions under which the 
simple aggregate relationship, P = 6M, approximately holds: (1) The 
variation in elasticity of demand across firms is small, that is, (@; — 0) is 
small on average; (2) the variation in marginal costs across firms is 
small, that is, (M; — M) is small on average; and (3) there is little linear 
relationship between the elasticity of demand and marginal costs 
across firms. The term after the plus sign in equation (3) is simply an 
estimate of the covariance between 6; and M ;- In the empirical model 
this term is assumed to be negligible, so P = 9M will be fit to aggregate 
data. This is supported by the facts that retail cigarette prices vary 
little across brands within any market and that costs of production 
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differences across brands are likely to be small since cigarettes are 
produced in a single region of the country with a standard technol- 
ogy. Finally, since firm- or brand-level data are not available, no 
adjustment for the potential covariance is empirically possible. 

A linear price equation for cross-section time-series data may be 
written as 


Pe= Xi + OR yor Ui, (4) 


where 7 refers to states, t refers to years, and U;, is a random distur- 
bance term. This says that as the tax per pack varies across states and 
years, the price varies by a proportion equal to or greater than 1.0. 
For example, if the tax in Washington is 4 cents per pack higher than 
the tax in Oregon then, ceteris paribus, if there is some monopoly 
power, the price in Washington will be more than 4 cents higher.* 

The model just presented notes that monopoly power would give 
cigarette manufacturers an incentive to charge different prices in 
states and years with different excise tax rates.> However, the obser- 
vation of this behavior is limited by the extent of interstate (and 
interyear) arbitrage. A cigarette merchant in a high-tax state who is 
charged a higher price by the manufacturer (because 6 > 1) would 
want to buy in a low-tax state, transport the cigarettes back to his own 
state, and thus, by competition, eliminate price differentials. 

In fact, the legal institutions regulating the distribution of cigarettes 
place strong restrictions on the potential for arbitrage. The federal 
tax is paid by manufacturers on all cigarettes sold in the domestic 
market with the tax stamp affixed at the factory and so is unrelated to 
across-state arbitrage. Distributors buying these cigarettes from the 
manufacturers are required to have state licenses in the state where 
they operate; they must pay the state tax and affix the state tax stamp 


4 For eq. (4) to be the pricing equation, the demand elasticity, 7, must be a constant 
with respect to P so that and 6 are constant parameters. If the underlying firm-level 
demand curve does not have a constant elasticity, the price equation is nonlinear. 
Nonlinear variants of (4) are developed below by allowing 7 to be an explicit function of 
price. The methodology of this paper is applicable under any functional form for the 
firm-level demand curve. There is also variation across states and years in sales tax rates 
applied to cigarettes. However, if the ratio of the sales tax rate to |» | is small, the effect 
of sales taxes on (4) is negligible. Since the sales tax rates range from 0 to 0.06 and || 
is much larger (well above 1), I have ignored this complication in the sequel. 

5 Wholesale prices for cigarettes are published by the U.S. Department of Agriculture 
in its “Tobacco Situation” reports. These figures show that all brands and companies 
charge the same price. (There is a small price difference between regular cigarettes and 
long cigarettes.) Furthermore, these prices have remained the same over several years 
and move up together in periodic discrete jumps. (The jumps have been more frequent 
recently, given the inflation of the last few years.) However, these published prices are 
of little interest here. They do not necessarily reflect the price at which cigarettes are 
actually sold—only the “list” price. Thus discounts, promotional aids, and special 
consideration are not included. For the purposes of this paper, only actual transacted 
wholesale prices would be useful. 
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within a few days of receiving a shipment. In some states a distributor 
may receive shipments to be sold and taxed in another state, but that 
typically requires a special permit from the state government which 
gives information on the quantity of the shipment and its final desti- 
nation. Since distributors must be licensed to pay the state tax and 
hence to sell cigarettes legally within a state, manufacturers have avail- 
able (from public records) a list of all wholesalers of their products for 
each state. Furthermore, since to resell cigarettes out of state takes a 
special permit, the manufacturer can easily know the final state of sale 
for cigarettes which they ship. Thus it would be very difficult for 
cigarette distributors to arbitrage legally across state lines without the 
knowledge of manufacturers. This allows the manufacturers to dis- 
criminate on the basis of tax rate in the state of final sale. Further- 
more, profitable legal interstate arbitrage depends on price differen- 
tials generated by the monopoly power in the market and on the cost 
of shipping and handling. There is room for a differential of several 
cents per pack before distributors would be able to avoid manufactur- 
ers’ sanctions profitably or engage in illegal movements. Thus for 
implied differentials in the range found below, the estimates should 
be relatively free of the influence of potential arbitrage. This issue 
might warrant further analysis if large differentials were generated by 
monopoly power.® 

Monopoly on the part of wholesalers or retailers could also 
influence the relationship between P;, and R;, and hence could bias the 
interpretation of 7 as revealing monopoly power of the manufactur- 
ing firms. My assumption of a high degree of competition at these two 
levels is supported by the large numbers of retailers in each market 
area as well as significant numbers of distributors and chain-store 
organizations at the wholesale level. (For data on the retail and 
wholesale tobacco distribution industry and a summary of laws, see 
the National Association of Tobacco Distributors’ Coordinator [1979].) 


° Two effects, related to illegal arbitrage (“buttlegging”), may generate a relationship 
between tax rates and the average price net of taxes. First, in states with relatively low 
tax rates, a significant portion of cigarettes sold may be sold in large lots for transfer to 
another state. The average transaction cost per pack on these sales will likely be low, 
and therefore the measured average price of cigarettes sold will be low. This yields a 
positive correlation between the tax rate and the price net of taxes. Second, in states 
with relatively high tax rates a significant portion of cigarettes consumed may be bought 
outside legal channels. Competition from the illegal market (which does not pay the 
state tax) will cause a lower average price among the legal dealers. The result is a 
negative correlation between tax rates and average price net of taxes. Putting these 
together implies a concave relationship between tax rates and price. (Very-low-tax and 
very-high-tax states should lie below the regression line.) Below, I test for the impor- 
tance of smuggling in affecting the relationship between the retail price and the excise 
tax across states. Note that if smuggling were dominant, all tax-related price differen- 
tials would collapse, and the coefficient between price and tax rate would be near zero. 
This has not taken place. 
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II. Empirical Model 


The empirical specification of equation (4) requires a discussion of 
the X;, vector and the disturbance term U;,. The focus of this paper is 
on the effect of excise taxes on pricing; therefore, the purpose of 
specifying Xj, is to avoid bias in estimation of 0. The X;, vector is 
included to account for attributes of states and years that directly 
affect P and are correlated with R. In the estimation, several alterna- 
tive sets of variables were used and compared. The most complete 
specification uses a full set of dummy variables for states and years in 
X,,. Factors which vary from year to year are accounted for by a set of 
DY, variables with one variable for each year in the sample (with one 
excluded to avoid singularity). Factors which vary from state to state 
are accounted for by a set of DS; variables with one variable for each 
state in the sample (with one excluded to avoid singularity). In an- 
other specification, the national consumer price index is used in place 
of the DY, variables. In each case the price variation remaining, which 
is represented by U;,, is assumed to be uncorrelated with Rj. This Ui, 
term either is treated as a single disturbance or is modeled in a 
variance-components framework with separate random effects for 
the time-series and cross-section observations.’ 

In this paper, tax rate variation is treated as exogenous. Research 
explaining the pattern of cigarette taxes would be interesting but is 
not included in this paper.® Note that with the whole set of individual 
state dummy variables included in the price equation, a stable pattern 
of state tax rates is accounted for by these intercept terms. For exam- 
ple, the location of the tobacco and cigarette industries has not 
changed over time, so the effect that their location has on state prices 


7 Nonlinear models related to (4) were also applied. To allow for demand elasticities 
that were not constant, let [n/(n + 1)] = 0(P) = yP?-. Two plausible specifications of 
the error term yield models which may be estimated conveniently by nonlinear least 
squares. These are © 


Pig = (Xiah + yR itl cadary 


and 
Pit = (Xiah + yRit)'? Wit. (4") 


In both cases A = 1 implies that @ = y, and this parameter is a constant. 

® Some basic facts concerning the cigarette excise tax which might guide a theory of 
taxation are as follows: The federal tax rate has remained 8 cents per pack for more 
than 25 years. The median level of state taxes has risen from 3 cents per pack in 1954 to 
12 cents per pack in 1978. This amounts to a fall in combined taxes in real terms and a 
fall from 48.7 percent of retail price in 1954 to 35.5 percent in 1978. In 1978 state taxes 
ranged from 9 cents in North Carolina to 21 cents in Florida, Connecticut, and 
Massachusetts. The three states with a significant stake in production of cigarette 
tobacco and the manufacture of cigarettes (North Carolina, Virginia, and Kentucky) all 
have low tax rates. Variation across the rest of the states cannot be explained by a local 


cigarette-producing industry. 
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is captured by state dummy variables and does not bias the coefficient 
of Fue 


IV. Empirical Results and Interpretations 


The major empirical results of this study are reported in table 1. In 
this section, these results are presented and interpreted. A potential 
alternative interpretation suggested by Barzel (1976) is also consid- 
ered. 

The estimates reported are based on annual tax rates and price data 
for the years 1954—78 for 47 states and the District of Columbia. 
Alaska, Hawaii, and New Hampshire are excluded from the sample 
for the analysis reported in the table because of lack of data and 
peculiarities in their tax laws. (See n. 11 below for more discussion of 
the New Hampshire case.) 

Information on cigarette prices and tax rates is collected and pub- 
lished by the Tobacco Tax Council. The “weighted-average price per 
package” published for each state and year uses national weights for 
type of cigarette (regular, king, 100 mm) and for type of transaction 
(carton, single pack, machine). Wholesalers around the country are 
asked to provide estimates of the average retail price in their area for 
each type of cigarette and type of sale. These numbers are turned into 
state averages by the council. (This information is from corre- 
spondence with the very helpful personnel at the Tobacco Tax Coun- 
cil.) 

Table 1 shows estimates of the linear model of equation (4) under 
three alternative specifications of the X;, vector and the disturbance 
term. The table reports the estimated 6 and, to aid in interpretation, 
the implied price elasticity, n. In row 1, X; consists of a dummy 
variable for each state and for each year. The tax rate effect in this 
row is 1.074, which says, for example, that a 10 cent difference in tax 
rates causes a 10.74 cent increase in price. The coefficient is 
significantly above 1.0 (standard error = 0.013), which is support for 
rejecting the hypothesis of atomistic competition in this industry. 
However, the implied price elasticity is —13.5, which indicates a 
rather flat demand curve. Row 2 replaces year dummies with the 
National Consumer Price Index and drops the state dummy variables, 
while row 3 shows estimates from an application of a variance- 
components model with the price index as the only additional explan- 
atory variable. The estimates of @ for these rows are 1.029 and 1.069, 
and both are significantly above 1.0. Other specifications of X;, and U;, 
yielded estimates of 6 similar to those reported. 

This basic model was also estimated separately for each year using 
states as the unit of observation. The estimates of @ were generally in 
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TABLE 1 


ESTIMATES OF THE EFFECT OF THE TAX RATE ON CIGARETTE PRICE IN ALTERNATIVE 
SPECIFICATIONS OF THE LINEAR Monet (Eq. [4]) 














Implied Price Other Variables 

Estimated 6 Elasticity Included* Method 

1.074 —13.5 DYED; OLS 
(.013) 

1.029 —34.5 NCPI OLS 
(.011) 

1.069 —14.5 NCPI VCt 
(.013) 





Note.—The R”’s in each of these models were high, those with dummy variables were about .99, and those 
without the dummies were around .95. There were 1,200 observations (25 years for each of 48 states). SEs in 
parentheses. 

* DY, isa dummy variable for each year with 1978 excluded, DS; isa dummy variable for each state with Wyoming 
excluded, and NCPI is the national consumer price index for each year. 

7 In this equation a variance-component (VC) error scheme with an individual random effect for each state and 
each year was used. See Fuller and Battese (1974) for a discussion. 


the range reported in table 1 for the pooled model and were not 
significantly different from one another. No trend in these estimates 
was noted. Time-series changes in the industry do not seem to have 
caused biases in the estimates in table 1.9 

‘Two further comments should be made in relation to these results. 
First, these estimates are not consistent with an effective cartel in the 
cigarette industry. The firm-level elasticity estimates are well below 
the range of the industry demand elasticity estimates usually found 
for cigarettes. (These are generally between —0.3 and —0.8.)!° Second, 
the effect of monopoly power as a source of price variation in the 
industry is small compared with tax rate differences and other cost 
differences over time and space. 


* In fitting the nonlinear specifications presented in n. 7 above, the hypothesis that A 
= 1] is not rejected. For (4’) \ = 0.972 with SE = 0.027, and for (4”) A = 0.981 with SE = 
(0.026. These tests support the appropriateness of the linear model results presented in 
table 1. The estimate of y from (4’) is 1.029, and from (4") it is 1.062. These, together 
with the \ estimates above, imply values of 6 in the range of those in table | or slightly 
lower. This supports the hypothesis that while perfectly competitive pricing behavior is 
rejected, monopoly power is not a major factor in determining cigarette price differen- 
tials. The hypothesis that interstate smuggling would imply that price is a concave 
function of tax rate, as discussed in n. 6 above, is not supported by these results. As a 
further nonlinear specification, a squared tax rate term was added to the separate 
yearly price equations. In most years the (R,)? variable had an insignificant effect on 
price. In only a few years out of 25 were the coefficients for Ry positive and (Rj)? 
negative. 

10 In related work (Sumner 1979), using cross-section time-series data for the last 10 
years, which attempts to indirectly quantify the extent of interstate cigarette bootleg- 
ging (“buttlegging”), I find the national industry-level elasticity of demand for cigarettes 
to be approximately —0.40. This is in the range of other estimates from time-series 


data. 
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Barzel (1976) has suggested an alternative interpretation for the 
coefficient on R;, in the pricing equation. He argues that a higher per 
unit tax on a good will induce a higher quality per unit to be sold. 
Since the relative price of quality is lower in high-tax areas, substitu- 
tion of quality for quantity will occur. The after-tax price per unit will 
be higher in a higher tax rate area to reflect the higher quality of the 
product. He also notes that ad valorem taxes fixed as a percentage of 
retail price (as was the practice in New Hampshire prior to 1975) 
would not have this effect. 

As Barzel points out, products actually traded are combinations of 
many characteristics, most of which are extremely difficult tor either 
the government or the empirical economist to measure. Thus, when 
Barzel argues that the quality of cigarettes varies positively with the 
tax, no direct measurement of quality changes could reject that hy- 
pothesis. Barzel turns instead to price data on cigarettes across years 
and states as published by the Tobacco Tax Council. The nature of 
these data is crucial. As noted above, the price data available for 
cigarettes have the most obvious quality variations (type of sale and 
type of cigarette) held fixed across states. Furthermore, given cen- 
tralized manufacturing and national distribution of brands, there is 
little scope for direct product-quality variation. This leads one to 
question the strength of the expected Barzel effect in these data even 
if it were present in the real world. Hence it would appear that the 
“elasticity of demand” interpretation of 6 is a serious contender with 
Barzel’s quality shifter for explaining a 6 coefficient greater than 1.0. 
Finally, note that the Barzel effect would cause an upward bias in the 
measurement of monopoly power so that there is an even stronger 
case for concluding that the cigarette industry does not act as a 
cartel." 


1 As further consideration of the strength of the Barzel effect in cigarettes, some 
estimation was done which included data from New Hampshire with the other states. 
These results were essentially identical to those reported in the table. New Hampshire, 
which had an ad valorem cigarette tax until 1975, was found to have a strongly negative 
state dummy variable. However, it may be the case that New Hampshire has a low price 
of cigarettes net of taxes not because its tax was ad valorem as suggested by Barzel but 
for the same reason that the per capita sales of cigarettes in New Hampshire are more 
than double the average of the other New England states. That is, residents of nearby 
states buy a significant portion of their cigarettes in New Hampshire. Given size 
economies for sales transactions, a high proportion of this interstate trade would tend 
to reduce the average price net of taxes in New Hampshire. Barzel’s ad valorem tax 
hypothesis implies that the special position of New Hampshire would have evaporated 
after 1975 (when a fixed per unit tax replaced the ad valorem tax). The transaction cost 
hypothesis (among others) predicts that there will be no effect of the 1975 law change. I 
found that the New Hampshire residuals from the linear model estimated for each year 
are just as negative after 1975 as before 1975, which indicates that the ad valorem tax 
was not the different New Hampshire characteristic. This is not to suggest that the 
Barzel effect is not theoretically sound but, rather, that interstate cigarette prices may 
not be useful to test the theory. 
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The statistical results presented in table 1 support rejection of the 
hypothesis of atomistic competition in the cigarette industry. The 
demand curves facing cigarette firms are not flat but are close to it by 
most standards. Taking @ = 1.05, a 1 percent rise in the price of a 
firm’s product would cause a 20 percent fall in the quantity sold. If the 
cigarette prices are above marginal cost by 5 percent, monopoly 
power adds approximately 3 cents to the average price of a package of 
cigarettes. 

This paper has proposed and demonstrated the applicability of a 
scheme for measuring monopoly behavior that may be relevant to 
several different industries. For example, application to the U.S. 
gasoline market is currently being pursued. Data requirements are 
information on product price and some factor that has a directly 
measurable per unit cost. Variation over time or space or some other 
dimension is required for statistical implementation. 
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Swedish Tax Rates, Labor Supply, 
and Tax Revenues 





Charles E. Stuart 


University of California, Santa Barbara and Unwersity of Lund 


Effective marginal tax rates on labor income for the “representative” 
Swede have increased from roughly 50 percent in 1959 to 80 percent 
today. The effects of this increase in the level of taxation are exam- 
ined using a two-sector model parameterized to correspond to the 
Swedish economy. The model contains a single household which 
allocates labor to either taxed (essentially market) or untaxed (largely 
household) uses. The estimated long-run effects are sufficient to 
explain up to 75 percent of the recent decline in the measured 
growth rate of the Swedish GNP. Calculations of total tax revenues 
are also derived from the model. These peak when the tax rate is 
approximately 70 percent, indicating that Sweden is presently on the 
downward-sloping portion of its “Laffer Curve.” 


This paper considers the statement that high taxes can have serious 
consequences. Given the importance of the issue and the attention 
paid it in informal discussions, it is surprising that little formal analy- 
sis has been directed at it. The present treatment focuses on the 
Swedish experience. This is a case of current events steering an 
economist’s eye. The growth of the Swedish GNP has slowed consid- 
erably in recent years, falling from 4.4 percent per year during 
1962-67, to 2.9 percent during 1967-72, to 1.4 percent during 
1972-77. At the same time, tax revenues as a portion of GNP have 
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increased from 0.388 in 1962 to 0.627 in 1977.1 While the exercise of 
presenting these numbers proves nothing, it does indicate that a study 
of the allocational effects of high taxes may provide insight into the 
faltering of the Swedish economy. 

There are many possible explanations of the Swedish economic 
stagnation, and the goal here is not to test competing explanations 
against each other. Instead, the strategy is to concentrate on house- 
hold willingness to supply labor to a taxed (market) sector and to ask 
whether changes in the tax rate might be expected to have significant 
long-run effects on the equilibrium quantity of labor supplied. The 
first major conclusion of this paper is that the calculated “supply-side” 
effects of high taxes are of sufficient magnitude to at least partially 
explain the observed stagnation. 

A useful summary statistic for the effect of the level of taxation on 
labor supply is the (marginal) tax rate which maximizes total tax 
revenues (or, in more popular terms, the rate which yields a 
maximum of the “Laffer Curve” for labor). The usefulness of this 
rate—hereafter denoted t*—is that as taxes increase and ¢* is ap- 
proached, the marginal social cost of additional tax revenues becomes 
infinite. While the socially optimal tax rate cannot be determined 
without regard to a welfare criterion, rather extreme assumptions 
must be made about the nature of social welfare in order to imply an 
optimal tax rate greater than ¢*, since reductions in the tax rate from 
levels exceeding ¢* would increase both the worth of output and tax 
revenues.2 Values of t* are calculated here; depending on the as- 
sumptions used, the best estimates of ¢* lie in the interval 69-73 
percent. This leads to the second major conclusion of this paper, that 
actual tax rates in Sweden are in excess of t*. The aggregate marginal 
tax rate (defined below) in Sweden reached the 70 percent level in the 
early 1970s and is slightly in excess of 80 percent today. 

The vehicle of analysis is a two-sector (Harberger) model where an 
aggregate household allocates labor to either a taxed or an untaxed 
sector.? The former corresponds closely to the legal, market portion 
of the economy and is typically represented in national income ac- 
counts. The latter includes home production and activities yielding 
undeclared income (illicit tax evasion); leisure may also be included. 
The advantage of such a general equilibrium framework is that it 


1 These include direct and indirect taxes, social insurance premiums, and estate and 
gift taxes. 

2 See Cooter and Helpman (1974) for a discussion of optimal taxation under differ- 
ent welfare functions. 

3 A similar sectorial division was used by Boskin (1975). As in Harberger’s (1962) 
treatment, Boskin’s analysis is not based on an explicit utility function; demand elas- 
ticities are instead taken as parameters. The present model does explicitly specify a 
utility function; in this respect, it is similar to the model of Shoven and Whalley (1972). 
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yields a clear picture of how taxation affects the relative benefits of 
different uses of labor and how this in turn impinges on labor supply 
and output. An alternative and simpler approach would be to inter- 
pret econometric estimates of labor-supply elasticities as reflecting the 
net-wage sensitivity of labor supplied to taxed uses and to assume that 
any tax increase leads to an exactly corresponding reduction in the 
net wage of taxed labor. However, this alternative approach suffers 
from several weaknesses: (1) it is partial; (2) it would in practice be 
based on constant elasticity estimates, whereas (as will be seen below) 
labor-supply elasticities should vary with the marginal tax rate; (3) it 
does not yield estimates of output under different tax rates and thus 
makes simple welfare comparisons difficult; and (4) indeed, the set of 
activities measured in traditional labor-supply studies does not coin- 
cide exactly with the set of taxed uses of labor. Despite this care in the 
choice of a general equilibrium framework, the present study should 
still be regarded as more of a first analysis than a final answer to the 
problem of the supply-side effects of taxation. Perhaps the two most 
severe limitations of the analysis are the highly aggregated treatment 
of untaxed uses of labor and the relative lack of attention given capital 
accumulation. 

In the next section the model is formulated and parameterized. 
Results are presented in Section II. Section III is concerned with the 
predictive power of the results. Concluding remarks are made in 
Section IV. 


I. The Model 
A. Structure 


Consider a highly aggregated economy with two sectors, a taxed 
sector, in which employed labor is subject to a tax, and an untaxed 
sector, where supplied labor is not taxed. One may think of the 
untaxed sector as involving not only illicit tax evasion via undeclared 
exchange but also fully legal forms of tax evasion, such as painting 
one’s own home instead of hiring a professional painter, cooking 
fancy meals instead of eating out at a fine restaurant, repairing an 
automobile oneself, sewing clothes, watching children, cleaning the 
house instead of hiring domestic help, etc. It is also possible to inter- 
pret leisure as an output of the untaxed sector. The untaxed sector 
thus consists largely of home production activities. On the other hand, 
the taxed sector is that part of the economy which is traditionally 
treated in national income accounts. 


* A more detailed model of home production, leisure, and market work is contained 
in Gronau (1977). 
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Each of these sectors is represented by a Cobb-Douglas production 
function, a choice motivated by the observed statistical stability of 
these functions (see Douglas 1976). Letting Y, and Y, denote the 
outputs of the taxed and untaxed sectors, respectively, and letting L, 
and L, signify the corresponding uses of labor in these sectors, the 
production functions are: 


Yu ALt, (la) 
Y, = BL, (1b) 


where 4, a, B, and b are parameters. Formally, capital is subsumed in 
the coefficients 4 and B, while a and b represent labor’s shares in the 
two sectors. This formulation implicitly assumes perfectly inelastic 
supplies of capital to both sectors so that capital remains constant 
during the simulations. Aside from simplifying the analysis, such a 
treatment avoids the difficult empirical problem of estimating the 
capital stock in the untaxed sector. In addition, relaxing the constant 
capital assumption—by letting the supply of capital to either sector 
depend positively on sectorial net return—would cause the calculated 
value of ¢* to fall and hence strengthen the conclusion that the 
aggregate marginal tax rate in Sweden today exceeds t*. This is 
because an increase in the tax rate on labor income would then give 
rise to the following additional negative feedback on the tax base: As 
the amount of taxed-sector labor fell, capital’s marginal product in 
taxed production would also fall, decreasing the demand and hence 
equilibrium quantity of taxed-sector capital, thereby reducing the 
gross wage obtained in the taxed sector. The result is that the taxed 
sector’s wage bill would be lower (than it would have been with fixed 
capital).® 

Household choice dictates the allocation of labor to the taxed and 
untaxed sectors; the household supply constraint for labor is 


Lt Le KL. (2) 


For simplicity, all taxes are treated as impinging on households. 
The problem of the household—and again the analysis is simplified 
by considering the behavior of a single, aggregate household—is to 


° An additional source of erosion in the tax base for labor income can occur if capital 
is variable and if the tax under consideration falls on capital as well as labor income. 
This occurs with income taxes. Here, an increase in the tax rate will cause an increase 
(or no decrease) in the purchase price of capital, and less capital will be used in the 
taxed sector. This in turn reduces the taxed sector’s wage bill. The net result is that tax 
revenues must peak at a lower tax rate when capital is taxed than when it is untaxed. 
Again, the assumption of fixed capital leads to the “correct” bias; relaxing it while 
simultaneously allowing taxation of capital can only strengthen the conclusion that the 
aggregate marginal tax rate in Sweden today exceeds ¢*. 
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maximize a utility function dependent on Y, and Y,. Two alternative 
forms for utility are proposed: 


Una Yt Ys. (3a) 
U = Y3yi-«, (3b) 


The former encompasses the hypothesis that the household is an in- 
come maximizer and that Y, and Y, are perfect substitutes. The latter 
formulation allows imperfect substitutability between Y, and Yp. 
From the point of view of the household, tax revenues are redis- 
tributed on a lump-sum basis, and capital incomes are obtained on the 
same basis. The amount of Y, consumed by the household is thus 


Y,=(1 bul, -REL (4) 


where t is the average tax rate, w is the wage, R is redistributions, I is 
capital income, and where the output of the taxed sector is taken as 
numeraire. It is assumed that the government budget balances in 
equilibrium, R = twL,, and that capital’s share of Y, is 1 — a, sol = 
(1 — a)Y,. These conditions guarantee that the amounts of Y, produced 
and consumed are equal. The amount of Y, consumed is taken as the 
amount produced, as no taxes are levied on Y, or Ly. 


The first-order condition for a maximization of utility subject to 
(1b), (2), and (4) is 


(Lew = (BE / “Fy )OBLE. (5) 


where the left-hand side is marginal retentions of Y, from a unit of 
labor allocated to taxed production, given marginal tax rate t’, and 
where the two terms in parentheses on the right-hand side are the 
marginal rate of substitution and the marginal product of labor in 
untaxed production, respectively. The marginal rate of substitution is 
either one, for (3a), or [((1 — a)/a] - (Y,/Y,), for (3b). The import of (5) 
is that taxes affect the household by altering the relative marginal 
benefits of labor allocated to the two sectors. 

The model is closed by assuming that the wage in the taxed sector 
is determined under competitive conditions: 


= aALt!, (6) 


B. Parameterization 


To use the framework presented above as a basis for simulations, it is 
necessary to obtain parameter estimates of 4, a, B, b, and a, as well as 
to estimate the amount of labor available, L. A summary of the 
method of parameterization is presented here; a more detailed statis- 
tical discussion is contained in the Appendix. 
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The technique for parameterizing the model is to pick an appro- 
priate base year, choose units of measurement for labor so that the 
wage equals | during that year, and then to find parameters consis- 
tent with actual economic behavior in Sweden for that year. The 
choice of the base year for the study was 1969 as it was a year of 
“normal” business activity; that is, it was neither a peak nor a trough 
of a business cycle (1968 was a low, while a peak occurred in 
1970-71). 

First, a is taken as labor’s share of measured GNP (see Appendix, 
item 1). This yields a = 0.752, whence a knowledge of Y,, L,, and a 
allows A = 4.19 to be calculated in accordance with (1a). 

Estimating the shape of the untaxed sector’s production function is 
more difficult. Due to the long-term statistical stability of labor’s share 
in Cobb-Douglas functions and the consideration that many of the 
commodities produced by one sector can also be produced by the 
other, the naive assumption that b = a = 0.752 is used for most 
simulations. However, given the desirability of conducting sensitivity 
analysis, as well as the fact that 0.752 is somewhat greater than the 
values of labor’s share found by many other authors (for discussion, 
see Douglas [1976]), the alternative assumption that b = 0.5 is applied 
to two sets of simulations. 

Estimates of the amount of time spent in untaxed production and 
the value of labor in untaxed production are derived from time- 
budget and wage data, with the latter calculation based on a variant of 
the method used by Nordhaus and Tobin (1972) in their work on 
“measures of economic welfare” (“MEW”s) (see Appendix, item 2). 
In most of the simulations, L, and Y, are defined as not including 
leisure; this implies L, = 102.63 and a value of untaxed labor of 
54.45.§ In the remaining cases leisure is included, yielding L, = 
205.26 and a value of untaxed labor of 105.14 for 1969. Aside from 
the obvious benefit of including leisure as an alternative use of 
household time, the exercise of running simulations on L, defined to 
include leisure has the advantage of providing a sensitivity test on the 
otherwise fairly coarse estimates of untaxed production. Such a sen- 
sitivity test is particularly valuable, since estimated untaxed produc- 


° Hawrylyshyn (1976) surveys earlier estimates of the value of home production and 
discusses the appropriateness of the methods used. He also recalculates earlier esti- 
mates to put them on a comparable and more correct basis. He concludes that the value 
of labor in home production is 35-40 percent of GNP (see also Murphy [1978] and 
Adler and Hawrylyshyn [1978] for more recent estimates for the United States and 
Canada). The ratio of value of labor in home production to GNP in the present study is 
40 percent. One might have expected a higher value for Sweden in 1969, since greater 
taxes should imply relatively more time devoted to untaxed (home) production. Note, 
however, that these earlier studies use a 20 percent tax rate in deriving after-tax wages; 
the rates used in the present study are 30-40 percent. 
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tion is calculated as home production and thus fails to capture pro- 
duction which is untaxed due to illegal tax evasion." 

A knowledge of the amount of time spent in untaxed production 
and of the value of labor in untaxed production is then used to find a 
value of B for each case. The method is to solve 


hex (value of labor in untaxed production) (7) 
a phel NY Ee ea ae 


for Y, and then to substitute into (1b). 

Throughout most of the simulations, the total amount of labor 
available to the household is assumed to be perfectly inelastic. The 
value of L is thus the sum of L, and L, for 1969. This procedure might 
be objected to since greater after-tax wages may create an inducement 
for households to supply more labor. To evaluate the sensitivity of the 
results to this sort of effect (which might also arise if net immigration 
depends on after-tax wages) as well as to treat an alternative, “indi- 
rect” method of introducing leisure, several simulations are con- 
ducted using the specification 


Eclat le. (8) 


where c is chosen so that L = 205.26 when the 1969 values of w and ¢’ 
are inserted. This formulation assumes a constant, compensated elas- 
ticity of supply of labor of 0.1.8 

Finally, data on taxes are needed to complete the parameterization. 
To get a feel for how marginal taxes should be interpreted in the 
model, it is useful to think for the moment of the taxed and untaxed 
sectors as being market and nonmarket sectors, respectively, and to 
ask, What taxes will an individual face if he operates in the market 
sector? First, the individual sells his labor (L,). The employer pays a 
wage w. Before the employee receives his paycheck, payroll taxes are 
drawn. Next, income taxes are paid and the individual is left with a 
sum of money. This money is taken to a commodity market to be 
reconverted into goods and services. Indirect taxes are then drawn. 
Thus the tax rate relevant to the model includes payroll, income, and 


7 Unfortunately, no careful economy-wide estimates of the extent of illegal tax 
evasion exist for Sweden. For the United States, recent work by Feige (1979, 1980) 
suggests that the size of the “monetary unobserved sector” (i.e., the volume of unre- 
ported transactions which use money as a medium of exchange) has increased greatly in 
recent years. Feige calculates that the size of this sector was roughly 5 percent of 
measured GNP in the early 1960s but amounts to 25-30 percent of measured GNP 
today, with most of the growth taking place in the 1970s. (In correspondence, Feige 
expresses greater confidence in his estimates of growth rates for this sector than in the 
estimated absolute sizes of the sector.) 

5 Actually the elasticity is almost compensated since (1) all tax revenues are returned 
to the household on a lump-sum basis in the model, but (2) marginal increases in taxes 
may decrease national income slightly. 
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TABLE 1 
SWEDISH Tax Rates 1954-74 











Year Average Tax (t) Marginal Tax (¢’) 
1954 a Tal 419 
1959 rol 491 
1964 393 583 
1969 .468 .655 
1974 557 .764 





indirect taxes. In addition, certain government transfers are in- 
come-indexed, so a correction must be made for the marginal loss 
of subsidies due to increased labor (L,) income. The starting point for 
a calculation of these “total” tax rates is a study by Jakobsson and 
Normann (1974), where the aggregate average and marginal income 
tax rates for the Swedish economy were estimated.® Table 1 contains 
calculated values of the taxes used in the model (for details, see 
Appendix, item 3). 

On the basis of these rates, it is assumed that increases in tax rates 
satisfy 


t = 0.829t' — 0.075. (9) 


The slope of this function is calculated as the change in marginal tax 
rates divided by the change in average tax rates from 1954 to 1974. 
The constant (—0.075) adjusts so that (9) holds exactly for 1969. 

Four scenarios (sets of simulations) are considered under the 
income-maximization hypothesis. Scenario A is the basic simulation; it 
has b = 0.752, leisure not included in L,, and constant L. In scenario 
B, the value of b is reduced from 0.752 to 0.5 as a sensitivity test. 
Scenarios A and C are identical except that leisure is included in L, in 
the latter. Finally, scenario D differs from scenario A only in that L is 
assumed to vary according to (8). 

It is interesting that if the parameter values for scenarios A—D and 
1969 values of L, and L, are inserted into the first-order condition for 
a maximization of income (eq. [5] with a unitary MRS), it is found that 
the implied values of the marginal tax rate are significantly less than 
the rate which actually prevailed in 1969. These implied rates are 
between 0.47 and 0.49, which approximate the marginal tax rates in 
1959-60. The interpretation is that if the income-maximization hy- 


9 This was done by formulating an aggregate total tax function based on tax law and 
population (mainly income-distribution) data, simulating different total income-total 
tax values, and regressing total taxes on total income. Rates for 1974 were supplied by 
G. Normann. 
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TABLES2 


SCENARIOS AND PARAMETER VALUES 








Scenario b B a iE Explanation 





Income Maximization 











A. D2 nee 2 ae 205.26 Basic 

B. 5 LOS See 205.26 Sensitivity analysis, b 

C. B/D 255 Dae 307.89 Leisure included in L, 

D. LOD me Dee oes 22 (mG) le L depends on after-tax wage 
Cobb-Douglas Utility Maximization 

Ae 752 2.22 .744 205.26 Basic 

Bee a 10.75 .659 205.26 Sensitivity analysis, b 

Cc’. (152225 Dabo? 307.89 Leisure included in L, 

D’. .752 2.22 .744 228[w(1 —t’)]®! L depends on after-tax wage 

a SUS PP ANS 205.26 5-year adjustment lag 








Nore.—It is assumed that a = 0.752 and A = 4.19 throughout. 


pothesis is correct, there was a 9—10-year time lag in the adjustment of 
household behavior to tax rates. Such a long time lag may seem 
unreasonably high but is not impossible. Attitudes toward labor mar- 
ket participation and the “correct” amount of work are likely to 
change slowly, especially given the rigidity of social institutions such as 
the 8-hour day, the 40-hour week, etc. 

When the household is assumed to maximize the Cobb-Douglas 
utility function (3b), a parameter value for a must be specified. This 
value was chosen so as to be consistent with (or the root of) the 
first-order condition (5) for 1969 when all other parameters, en- 
dogenous variables, and the tax rate were inserted. Five scenarios are 
investigated for this utility function. The first four are similar to the 
four scenarios considered for income maximization. In the fifth case, 
itis assumed that a 5-year lag existed in the adjustment of L, and L, to 
changes in the tax rate. This is accomplished by letting the tax rate at 
which the model is parameterized be the marginal rate which oc- 
curred in 1964 (which, it may be noted, occupied a position in its 
business cycle similar to that of 1969). 

A summary of the scenarios is given in table 2. 


II. Results 


Within the framework presented above, simulations are conducted by 
plugging various marginal tax rates into condition (5) and then solv- 
ing numerically for the values of L, and L, and, hence, also of Y, and 
Y,, which solve (2), (5), (6), and, when applicable, (8). Equation (9) is 
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TABLE 3 
RESULTS 
Scenario and Marginal Tax (%) %AL, %AY , aay ie t* (%) 
A: 
80 225 —15.6 10.2 
65 —10.0 ae /e4l 1.86 46 
50 — 4.63 — 3.44 63 
B: 
80 —19.0 —14.6 7.92 
65 = 141 — 5.64 1.81 51 
50 Sali? = BRD .84 
G: 
80 —19.4 = 15:3. iA 
65 —10.57 ae aol 3.26 43 
50 — 5.69 — 4.26 1.10 
DD: 
80 —19.8 57) 10.2 
65 — 9.78 = 7A5 1.86 46 
50 = 4.73 = 3.51 .63 
Ais 
80 — 3.24 See .80 
65 — 1.46 — 1.10 52 73 
50 = she) =O) 40 
BY: 
80 ea. 3 — 2:43 1:13 
65 alee ele) .79 73 
50 = .83 = 1158 .64 
Ch: 
80 = 3:98 — 3.00 1.54 
65 = 195 a 1339 1.01 69 
50 eae oe. — .87 .78 
192 
80 = 3:68 a 2 .80 
65 — 1.75 = 1225 52 71 
50 = 96 = =76 40 
E’: 
80 ee i — 2,68 .92 
65 re det al aol .60 ie 
50 ee) 69 46 





Norte.—Percentage changes are calculated using differences in natural logarithms for tax rate increases of 1 
percent (e.g., from 80 percent to 81 percent). 


then used to determine total tax revenues (1.e., to derive the “Laffer 


Curve”). 
The results of the simulations are summarized in table 3. Since the 


model is parameterized using data from 1969, when the marginal tax 
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rate was 65.5 percent, simulations at tax rates significantly greater or 
less than 65.5 percent may be less reliable than simulations at rates 
close to 65.5 percent. For this reason, results are reported for the 65.5 
percent marginal tax rate, as well as for +15 percent changes in 
marginal rates. Reported in the columns marked “%AL,” and 
“% AY,” are the percentage changes in L, and Y, induced by a 1 
percent increase in marginal taxes (i.e., from 80 percent to 81 per- 
cent, 65 percent to 66 percent, and 50 percent to 51 percent). The 
column Y,/Y, gives the ratio of outputs of the two sectors. Finally, the 
tax rate which maximizes total tax revenues, ¢*, is presented. 

Several facets of the results stand out. First, the scenarios based on 
income maximization yield quite striking results. As tax rates increase 
from the 50 percent to 80 percent brackets, the ratio of untaxed to 
taxed output rises dramatically, as do the sensitivities of taxed-sector 
employment and output to small tax increases. The scenarios based 
on Cobb-Douglas utility maximization give somewhat less striking 
results, but the general pattern is the same. For instance, a shift from 
50 percent to 80 percent taxes generally implies a doubling in the size 
of untaxed production relative to taxed production. 

An important aspect of the results is that, as tax rates rise in 1 
percent increments from 50 percent to 80 percent, the effects of 
successive increments on taxed-sector employment and output be- 
come increasingly larger. Part of the explanation is that the optimiz- 
ing household is interested in the rate of marginal retentions (1 minus 
the marginal tax rate), and marginal retentions fall by increasing 
percentages as marginal taxes rise incrementally. For example, if 
equations (5) and (6) are consulted and the marginal rate of substitu- 
tion is assumed constant (the income-maximization hypothesis), then 
a 1 percent tax increase around the 50 percent bracket requires a 
change in L, and L, such that the ratio of labor’s marginal products in 
the taxed and untaxed sectors rises by 2 percent, but at an 80 percent 
tax rate, the same 1 percent tax rise requires a 5 percent increase in 
the ratio of labor’s marginal products. 

The calculated values of ¢* are on the order of 50 percent under 
income maximization and 70 percent under Cobb-Douglas utility 
maximization. The sensitivity of ¢* around the 50 percent and 70 
percent figures is small, indicating a fair degree of robustness to the 
details of the parameterization of the model. The robustness of the 
calculated values of ¢* is enhanced by the fact that these values lie 
relatively near the tax rate which prevailed in 1969, the year used for 
parameterization. Further, the inclusion of leisure into L, leads to 
relatively small declines in t* of 3-4 percent from the basic scenarios 
(A and A’). 

The values of t* calculated here are for taxes on labor income alone. 
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However, there are two reasons why increases in the aggregate tax 
rate used in the present analysis might also affect the extent of 
taxation of capital: (1) except for the payroll tax, all of the compo- 
nents of this tax rate also apply, at least partially, to income from 
capital; and (2) in attempting to finance a greater volume of public 
spending, it is likely that the government would utilize (or has 
utilized) a mix of tax increases for both labor and capital incomes 
instead of relying solely on increased taxation of labor. One might 
therefore ask, How would the calculated values of t* change if t* were 
redefined to imply a maximum of tax revenues from both labor and 
capital incomes? 

To investigate this issue, it is useful to retain for the moment the 
assumption that capital is fixed. In this context, a simultaneous in- 
crease in the tax rates on labor and capital will have a positive initial 
impact on revenues from capital taxation but a negative feedback 
effect since national income, and hence the amount of capital income 
subject to taxation, will decline. The following calculations indicate 
that the net effect on the previously calculated values of ¢* is likely 
to be negligible. In 1969, returns to capital were 33.8 (billions of 
kronor). Total taxes collected in the economy were 62.8, of which 48.0 
were levied on labor income. Of the remainder, something like 14.0 
can be regarded as taxes on capital, whence the average tax rate on 
capital was 0.41 (= 14/33.8). From 1954 to 1974, the average tax rate 
on labor income rose by 0.286 (see table 1). This rise was partially due 
to a hike in the average income tax rate of 0.15 and to a rise of 0.02 in 
the indirect tax component of the rate on labor income. One possible 
set of assumptions, then, is that the average tax rate on capital rose by 
0.17 and that each 1 percent increase in the average tax on labor thus 
“caused” a 0.6 percent (= 0.17/0.286) increase in the average tax on 
capital, so that the tax on capital satisfied t, = 0.6t, + 0.13 (with 0.13 
chosen so the 1969 values, t, = 0.41 and ¢, = 0.47, hold simultane- 
ously). Then, with capital in perfectly inelastic supply and capital’s 
share of Y, equal to 0.248 (= 1 — 0.752): 


Total tax revenues = taxes on labor + (0.6¢, + .13)(0.248Y,). 


Values for taxes on labor, t,, and Y, can be obtained from the simula- 
tions. It is then a simple matter to calculate the modified tax rates 
which maximize total tax revenues, including taxes on capital. For 
scenarios A-D and A’-E’, these are, respectively, 0.45, 0.50, 0.41, 
0.44, 0.72, 0.72, 0.68, 0.70, and 0.71. Correcting for taxes on capital 
in this way thus reduces the table 3 estimates of ‘* by about 0.01. 
Of course, it might be argued that the relationship between tax 
rates assumed in this formulation (t, = 0.6t, + 0.13) understates the 
tendency for taxes on capital to rise where the rate on labor income 
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goes up, and that this yields an overly pessimistic view of the ability of 
the government to increase tax revenues. Perhaps a more plausible 
formulation would be that ¢, and ¢, lie in constant proportion to each 
other, t, = Bt, (and, say, 8 = 0.47/0.41). In this case, observe that: 


] = 
‘Total tax revenues = t,aY; + i.{(1 —a)Y, = WY (a Be Somer 


and a maximum of ¢,Y, must imply simultaneous maxima for both 
taxes on labor (t,aY ,) and for total tax revenues. 

Allowing in this way for capital taxation but keeping capital fixed 
thus has little or no effect on the calculated values of t*. What then is 
the effect of relaxing the assumption that capital is fixed? There are 
two reasons why this must cause the calculated values of t* to fall. 
First, if the total supply of capital to the economy depends positively 
on capital’s net return—as the work of Boskin (1978) suggests—then 
increased taxes on capital will reduce the amount of capital. in the 
economy, causing a decline in income and resultant declines in tax 
revenues from both capital and labor incomes. Second, to the extent 
that the taxed sectors for labor and capital coincide, a rise in the rate 
on capital will drive capital out of the taxed sector, and again taxed 


sector income, and hence tax revenues from both capital and labor, 
will fall. 


III. Does the Model Predict Well? 


It turns out that the simulations based on Cobb-Douglas utility 
maximization predict fairly well the actual behavior of output and 
employment during the period of rising taxes in Sweden. The income 
maximization hypothesis, on the other hand, leads to predicted de- 
clines in output and employment which greatly exceed actual de- 
clines. This allows a tentative rejection of the income maximization 
hypothesis. In what follows, rough calculations of the effects of recent 
increases in marginal taxes in Sweden are made under the assumption 
of Cobb-Douglas utility maximization. These predictions are then 
compared with the actual development of output and employment. 
First, it was noted at the outset that yearly growth rates declined 
from 4.4 percent in the early and mid-1960s to 1.4 percent in the mid- 
and late 1970s. This amounts to a decline in the growth rate of 3 
percent per year. Over this period, marginal tax rates rose an average 
of something like 1.5 percent per year, increasing from around 55 
percent to upward of 80 percent. In 1972, the marginal tax rate was 
70-71 percent. A calculation of %AY, at a 70 percent rate yields 
values of between 1.4 percent and 1.7 percent, depending on the 
scenario chosen. Applying a figure of 1.5 percent to the entire period 
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1972-77 (a procedure which effectively assumes a slight time lag in 
the adjustment of L, to changes in tax rates) implies a 2.25 percent 
output decline per year (i.e., the 1.5 percent yearly increase in tax 
rates X the 1.5 percent decline in output per percent increase in 
taxes). This calculated yearly output decline due to rising taxes 
amounts to 75 percent of the 3 percent fall in the growth rate.’ 

The model also predicts a decline in employment. During the 
period 1964-69, marginal taxes rose from 58.3 percent to 65.5 per- 
cent, which implies average yearly reductions in L, of between 1.8 
percent and 2.5 percent, depending on the scenario, if there was no 
adjustment lag. From 1964 to 1969, the number of hours worked 
divided by the population of working age (16—74) actually fell at a 1.6 
percent yearly rate. This is a fairly good fit. Moreover, a legislated 
reduction in the workweek was effected in 1970-72, and this may be 
interpreted as partially representing an adjustment to the tax rates in 
effect in the late 1960s. 

During the period 1969-77, the ratio of hours worked divided by 
the population of working age fell roughly 7 percent. The increase in 
marginal taxes which occurred during this period should, according 
to the simulations, have led to reductions in employment significantly 
in excess of this figure. While there is no reason to believe that the 
economy was particularly lax in adjusting to its equilibrium level of 
employment in 1969 (indeed, there was net immigration throughout 
the late 1960s), there are two reasons for supposing such a lag 
existed in the 1970s. First, the very size of the decline in L, predicted 
from scenarios A’—E’ (on the order of 30 percent) makes it unlikely 
that full adjustment could occur in a time span of less than 10 years. 
Second, a host of governmental and government-sponsored job- 
support programs, as well as a strict job-tenure law, were instituted 
and expanded in the 1970s. Assume then that, as of 1977, the econ- 
omy was adjusted to a 74 percent tax rate (the rate as of about 1973). 
Depending on the scenario, this should have led to a 15—20 percent 
decline in L, from 1969 to 1977. Since measured unemployment 
increased only slightly, this leaves roughly a 10 percent discrepancy 
between measured and predicted employment in the taxed sector as 
of 1977. Given the aforementioned job-support programs and the 


10 A prediction of future growth rates in Sweden can be derived. Assume that the 
marginal tax rate in effect today, about 80 percent, remains constant and that the 
economy is presently adjusted to a tax rate of 75 percent. This assumes a 5-year 
adjustment lag. Comparing output levels at 75 percent and 80 percent marginal tax 
rates, something like an 11 percent output decline (+1 percent, depending on the 
scenario) is to be expected. Spreading this out over the next 5 years, Swedes can await 
an average growth rate of roughly 1.5 percent over the next 5 years (4.4, less the 
unexplained 0.75, less 11/5), with yearly growth rates in the neighborhood of 3.5—4.0 


percent thereafter. 
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job-tenure law, the most reasonable use of this 10 percent figure is to 
interpret it as an estimate of the increase in the level of “hidden 
unemployment” (including on-the-job leisure) in the taxed sector of 
the Swedish economy. Since officially measured unemployment is 
typically only 0.5-2 percent, a 10 percent hidden unemployment 
figure is not unreasonable. 


IV. Concluding Remarks 


The analysis here suggests that high taxes on labor income can have 
quite significant effects on the sale of labor in taxed markets when 
alternative untaxed uses of labor exist. In this way, taxes can and 
probably have affected the recent growth performance of the Swedish 
economy. Further, the treatment permits the conclusion that total tax 
revenues in Sweden would be maximized at a tax rate of roughly 70 
percent, which is 10 percent less than the rate prevailing today."! 
It is important that the rather severe effects of tax increases above 
some critical level (70 percent?) may be associated with adjustment 
lags, at least if the tax increases occur over a short period of time. 
Social institutions such as the 40-hour workweek and attitudes toward 
work and labor market participation act as a brake on the adjustment 
process. Similarly, attitudes and effort devoted toward tax evasion 
(both legal and illicit) change slowly. Measuring these is perhaps as 
much the task of the sociologist as the economist. I will refrain from 
making subjective judgments about whether I think attitudes have 
changed in the last 10 years in Sweden. There are a number of 
indications of pressure on the institutions, however: Legislated 
maternity leaves have been increased; the yearly vacation has been 
increased from 4 to 5 weeks; there are increasing demands for part- 
time jobs; and an official commission recently investigated the 
feasibility of a 30-hour workweek, citing that “the long-run goal [of 


"If households are assumed to maximize a constant elasticity of substitution (CES) 
utility function with an elasticity of substitution of 0.5, and the assumptions of scenario 
A’ are used to derive parameter estimates, the calculated value of t* is 82 percent. This 
is close to the present tax rate. Decreasing the elasticity of substitution from 1 (as in the 
Cobb-Douglas case) to 0.5 takes away about half of the explanatory power of rising 
taxes as a determinant of growth rates. It also implies a simulated yearly decline in L, 
from 1964 to 1969 of 1.1 percent (the actual rate was 1.6 percent), and implies a 9.6 
percent drop in L, from 1969 to 1977. 

” A survey conducted by the university graduates labor union in 1978 and published 
in the union paper (SACO-SR tidning, March 5, 1979) asked, “Do you wish to work part 
time?” Thirty-one percent answered yes. Since 1975, the percentage of men answering 
yes rose from 14 percent to 27 percent. A third of all M.D.’s answered yes. Sixty percent 
of the members surveyed preferred a shorter workweek to increased pay. Forty-four 
percent of the women wanted a 6-hour day, and 48 percent of the women with full-time 
jobs wanted to work part time. 
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the labor unions] is a 30-hour workweek” (Statens Offentliga Utred- 
ningar 1976, p. 9). 


Appendix 


A more detailed description of the parameterization of the model is pre- 
sented here. Unless stated otherwise, figures apply to 1969. 

1. GNP at factor prices was 136.4 billion kronor, whence Y, = 136.4. 
Wages were 92.07 (in billions of kronor). In addition, there was an operating 
surplus of 39.7, some of which is returns to labor. To estimate how much, 
note that wages paid by industries and households were 62.7. Note also that 
hours worked by entrepreneurs and employees were 761 and 4,518 million, 
respectively. The assumption that entrepreneurs have the same wage as 
employees leads to an estimate of the implicit wages of entrepreneurs of 62.7 
x (761/4,518) = 10.56. Thus L, = 92.07 + 10.56 = 102.63 and a = 102.63/ 
136.4 = 0.752. 

2. Unfortunately, the only household time-budget study available was for 
1976. Activities included in untaxed production are cooking, child care, 
clothing repair, care of pets, care of dwelling, care of cars, boats, and bikes, 
care of garden, and other (cleaning, washing, dishes, purchasing, etc.). Time 
spent in these activities averaged 672 minutes/day. Since the population 
included in the study consisted of families with children living in apartment 
buildings, and since the study was conducted during winter months when 
care of dwelling, garden, etc., are likely to be seasonally low, 20 minutes/day 
was added to time spent in untaxed activities to seasonally adjust the data. 
Work (and study) took 642 minutes/day. This implies that L, = (692/642)L, = 
1.08L, = 110.84. However, taxes were greater in 1976 than in 1969, so the 
estimate of L, for 1969 was that it just equalled L,, that is, L, = 102.63. 

This estimate of L, does not include leisure. To include leisure, note that 
647 minutes/day were spent by the average household in the study in recre- 
ation and relaxation in 1976. On this basis, L, was taken as exactly twice L, 
(i.e., Ly = 205.26) when leisure was included. 

The value of untaxed production was estimated by 


En, LArw? + hfw), 


where n; is the number of households of type j, h7j is the number of hours/ 
year spent by males in household j at activity 7, w7" is the after-tax male wage 
for activity 7, and h{,and ware similarly defined times and wages for females. 
Six household types were considered: with one adult and 0, 1, or > 2 
children, and with two (or more) adults and 0, 1, or = 2 children. The basic 
data on the n; were taken from the 1970 Population and Housing Census, part 9. 


13 In addition, all of the major political parties in Sweden except the conservatives 
(with roughly 20 percent of the vote) support a 6-hour working day; the conservatives 
state only that the matter should be decided in union-management negotiations. 
One might ask, How does a 30-hour workweek fit with the predictions of the model? A 
shift from 40 to 30 hours/week is a 29 percent reduction. The actual reduction may be 
somewhat less if such a change is legislated, however, since labor-market participation 
might rise (it has risen historically during such changes). Thus something like a 20 
percent decline in taxed-sector labor would occur. This is almost exactly the decline 
needed to restore equilibrium at the 80 percent tax rates prevailing today given the 
logic of scenarios A’—E' (cf. Sec. III). 
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An adjustment was made for the fact that roughly 30 percent of the persons 
reported to be single adults living with children are actually also living with 
another adult (for tax and subsidy reasons, it is advantageous to report in this 
way—the source for the 30 percent figure is the time-budget study, Kon- 
sumentverket 1977, p. 30). 

Wage data were taken from Wages, 1969. The relevant tax rate is the 
marginal income tax rate plus a correction for income-based subsidies. For 
1969, these were 0.478 and 0.052, respectively (for discussion, see Appendix, 
item 3 in this study). However, application of a 53 percent tax rate is likely to 
overestimate the taxes which would be paid on marginal income, due to the 
fact that many of the individuals devoting large amounts of time to home 
production have low incomes and hence lower marginal taxes than the aver- 
age income earner. A 40 percent tax rate was thus applied to all untaxed 
activities except care of pets, care of garden, and other—a 30 percent rate was 
used for these. 

The following assumptions were used to estimate hours spent in untaxed 
production. (1) The two-adult, childless household was considered to have the 
same uses of time in untaxed production as a two-adult, one-child (ages 4—6) 
household except that time spent in child care was set at zero. (2) Times in 
untaxed production for a one-adult, one-child household were 0.94 times the 
hours for a woman in a two-adult, one-child (ages 4-6) household. The figure 
0.94 derives from the consideration that a single parent is likely to spend 
more time in market work than is an average parent in a two-adult household. 
(3) The single-adult, no-child household was assumed to have the same uses 
of time as the average over males and females of time spent in a two-adult, 
single-child (ages 4—6) household, except that time spent on child care was set 
equal to zero. (4) The one-adult, = 2 child household was assumed to have 
the same uses of time as a female in a two-adult, single-child (ages 4-6) 
household. 

In evaluating the value of leisure time, the after-tax “wage” rate was taken 
as 5 kronor/hour. This figure was generally less than the after-tax wage rates 
for untaxed productive activities. 

3. For each year, the tax-rate base used for calculating tax rates was one 
plus the payroll tax rate (sources: Industriens utredningsinstitut 1976, pp. 
77-78; Svenska arbetsgivareforening 1978). The total tax rate consisted of 
the payroll tax plus the average or marginal rate on income and the average 
or marginal indirect tax rate, plus a correction for income-based subsidies, all 
divided by the rate base. Indirect tax rates were found by dividing total 
indirect taxes by disposable income and multiplying by 1 minus the average or 
marginal tax rates on income to find average or marginal indirect tax rates, 
respectively. It was then assumed that 75 percent of the calculated indirect 
rates were borne on the margin by labor income. Approximate correction 
rates for transfers were derived by dividing transfers to households by dis- 
posable income. Payroll taxes vary from a few percent in the 1950s to 22.4 
percent in 1974 (to 35.1 percent today). Indirect rates, after being multiplied 
by 0.75, are on the order of 5-10 percent. The transfer correction term rose 
from 3.5 percent in 1954 to 6.9 percent in 1974. 

A sample calculation—for the 1969 aggregate marginal tax rate of 65.5 
percent—is as follows. Jakobsson and Normann (1974) report an aggregate 
marginal income tax rate of 0.478. The payroll tax rate for 1969 was 0.125. 
Note that this rate is assessed per krona of gross wages (i.e., wages before 
income taxes) paid by employers. Indirect tax revenues divided by disposable 
income were 0.209 (= 18,409/87,979), implying a marginal, indirect-tax rate 
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on net-of-income-tax income of 0.157 (= 0.209[3/4]). In addition, transfer 
payments divided by disposable income were 0.052 (= 4,553/87,979). Thus, 
an employer who paid 1.125 kr for labor generated payroll taxes of 0.125 kr 
and gross income of 1.0 kr. Of the income, 0.478 kr went to income taxes 
while the remainder (1 — 0.478) was available for consumption and saving 
and is assumed to have been taxed at a 0.157 rate. The 0.052 transfer rate is 
treated as an additional marginal tax payment on gross wages. The aggregate 
tax rate is thus total taxes generated by the 1.125 kr increment in income, 
divided by 1.125: 


0.125 + 0.478 + 104i 0.157) + 0.052 _ 9 655. 
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A Superior Solution to Captain MacWhirr’s 
Problem: An Illustration of Information 
Problems and Entitlement Structures 


Gene E. Mumy 
Ohio State University 


You couldn't tell one man’s dollars from another’s, he 
said, and if you asked each man how much money he 
brought on board he was afraid they would lie, and he 
would find himself a long way short. [JosEPH CONRAD, 
Typhoon] 


In his novel Typhoon, Joseph Conrad posed a classic problem of 
information revelation. At first glance, the problem appears to be 
insoluble because of the apparent incentive for individuals to gener- 
ate wrong information. After further consideration, however, it turns 
out that an appropriate change in the entitlement structure of the 
problem allows it to be solved by generating an incentive for each 
individual to reveal correct information. In this sense, the problem is 
akin to the class of free-rider problems associated with public goods 
that can be solved by using the demand-revealing process (Tideman 
and Tullock 1976). 

Conrad’s problem is stated in Section I, along with the arbitrary 
solution imposed by the fictional character Captain MacWhirr. A 
superior solution, which actually generates correct information, 1s 
presented in Section II. Section III contains a brief discussion of the 
similarities between the superior solution and the demand-revealing 
process for public goods, followed by conclusions. 


The author would like to thank Richard Jensen and William Manson for their helpful 
comments. They are not, however, responsible for any errors in this paper. 
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I. The Problem and Captain MacWhirr’s Solution 


In Typhoon, 200 Chinese workers are being transported to their home 
province on a ship commanded by Captain MacWhirr, after working 
for 7 years in various tropical colonies. Each worker’s accumulated 
savings of silver dollars are stored in his own camphorwood chest. As 
the title of the story suggests, all is not smooth sailing during the trip 
and the ship runs into a typhoon. 

The problem that we are interested in arises when, during the 
course of the storm, all the wooden chests get smashed and the silver 
dollars are all scattered between decks. A riot ensues among the 
Chinese workers as they try to retrieve their money. In order to stop 
the conflict, the captain sends the first mate and some men to pick up 
all of the money, which they do. When the storm is over, the captain, 
being a fair man, wants to return the money to its rightful owners, but 
he has a problem. He does not know how much money each man had. 

The problem here is to obtain information about how much money 
each man actually had, assuming only that the captain knows the total 
amount of money and that each man knows the amount he had but 
not necessarily the amount that anybody else had or even the total 
amount of money. The captain realizes that he cannot just ask each 
man how much money he had and then give him that much because 
some of them might overstate the amount. In this event, the sum of 
the claims would exceed the total amount of money, thus making a 
distribution on this basis impossible. The captain concludes that it is 
impossible to guarantee correct information and therefore imposes 
an arbitrary distribution that he thinks is fair. According to him, the 
fair assumption is to assume that all the men had the same amount of 
money and, therefore, he gives an equal share of the total to each 
man. 

It is obvious that this distribution is not necessarily fair at all unless 
all the men really did have the same amount of money, which is not 
known to be the case. The captain’s solution to the problem is not a 
solution to the information problem but rather a surrender. Such a 
surrender, however, is not necessary, as will now be demonstrated. 


II. A Superior Solution 


If only the individual knows how much money he had, the captain is 
right to think that this information can only be obtained by getting the 
individual to reveal the correct amount. He is wrong, on the other 
hand, to think that there is necessarily an incentive for the individual 
to overstate the amount. The mistake here is the implicit assumption 
that the amount stated by an individual is also the amount to which he 
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is entitled. Under such an entitlement structure, each individual has a 
clear monetary incentive to overstate his share. The way to solve the 
problem, then, is to cut this direct link between amount revealed and 
entitlement in such a way as to obtain correct information while at the 
same time entitling individuals to the amounts they actually had. 

In Captain MacWhirr’s problem, there is a fairly straightforward 
entitlement structure which generates an incentive for individuals to 
reveal correct information. Instead of letting the amount claimed be 
the entitlement, let the maximum amount to which an individual is 
entitled be the difference between the total amount of money and the 
amount claimed by everyone else. If an individual claims less than the 
maximum amount, then he gets what he claims, but if he claims more 
he is penalized and actually receives less than the maximum amount 
to which he is entitled. As a result, no individual has an incentive to 
overstate his share (or to understate it) and he reveals the amount he 
actually had. This also means that each individual is thereby entitled 
to the amount he actually had. A more formal presentation of the 
problem will make this point clearer. 

For the following analysis, let 7, = the amount received by the ith 
individual; c; = the amount claimed by the 7th individual; a; = the 
amount the 7th individual actually had; and A = the total amount of 
money. Under an entitlement structure like the one sketched out 
above, the amount received by the 7th individual can be written as 


n=a—max{(1+afa—(4-Da)o} ae ee eer) 


jAt 


where A — 3 ,4;¢; is the difference between the total amount of money 
and the amount claimed by everyone else and is, hence, the ith 
individual’s maximum entitlement. It follows that a > 0 if individuals 
are to be penalized for claiming amounts greater than the maximum 
entitlement.’ 

The individual’s problem is to choose ¢; so as to maximize 7;. This, 
however, calls for an assumption about 2j,;¢;. The reasonable as- 
sumption here is that > iC; = 2 jij, as will be seen by the solution to 
the individual’s problem. If we substitute 24a, for Xj.:¢; and note 
that it follows by definition that 4d — Did h = awe. cant rewrite 
equation (1) as 


f,=¢, — max([(1 + a)(G — a,), OI. (2) 
LIfc, <A —Djeicj, then 7; = c; because (1 + ale; — (A — > j«iC;)] < 0. Also note that it 


is assumed that the amount of money that individuals are penalized is kept by the 
captain. 
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From equation (2), it is easy to see that r; = a; if c; = a; but that r; < a; if 
c; 2 a;, and as a result, the individual honestly reveals and receives the 
amount of money he actually had. Furthermore, this is true for all z, 
which means that the assumption of 2 j4:¢; = 2ji@; is validated and 
we have a Nash equilibrium solution for our problem. 

This solution is obviously superior to the captain’s solution because 
it returns to all individuals the amounts of money they actually had, 
regardless of what the actual amounts were. This depends on obtain- 
ing correct information, which is accomplished by eliminating the 
direct link between individuals’ claims and their entitlements. It is that 
direct link which gives rise to the incentive for wrong information. 


III. Comparison with the Demand-revealing Process and 
Conclusion 


It was stated in the introduction that the superior solution to Captain 
MacWhirr’s problem has a certain similarity with the demand- 
revealing process for determining preferences for public goods. We 
are now in a position to see what this similarity consists of. In brief, the 
similarity is that both are solutions to information problems that are 
obtained by separating the information revealed by an individual 
from the individual’s entitlement. 

The classic public goods problem is that the free-rider phenome- 
non generates an incentive for individuals to understate their valua- 
tions of a public good if their share of the financing depends on that 
valuation. That is, if individuals feel that their own share of total 
financing is negligible and, hence, that they are quantity takers from 
the total collective decision, then they have an incentive to understate 
their demand for the good if their entitlement (obligation) to share in 
the financing is determined by the demand they reveal. The twist in 
the demand-revealing process that causes individuals to accurately 
reveal their demands is that the entitlement structure is changed from 
the one that is usually implicitly assumed. Instead of obligating the 
individual to share only in the financing of the stated quantity desired 
at a given tax price, the individual is obligated, rather, to share in the 
financing, at a given tax price, of the quantity chosen by everyone else. 
This simple change in entitlement basically eliminates the possibility 
of a free ride. Then, by using suitably designed penalties for deviating 


> This is not the case with other assumptions about 2 ;;c; and, hence, the justification 
for the assumption used. That is, if it is initially assumed that 2 j,;c; > 2 j:a;, this means 
that 7; is maximized with c; < a;. But, if each person knows that everyone else has the 
same incentive to act in a similar fashion, this would lead to the conclusion that 3 j4;¢ ; < 
> j«i@;, which contradicts the initial assumption. In the same way, the initial assumption 
of SjiC ; < %j«a; also leads to a contradiction. 
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from the basic entitlement given by everyone else, the individual has 
the incentive to reveal his true demand for the good. Because 
everyone thus has incentive to reveal his true demand, the total 
demand for the good is honestly revealed and the appropriate quan- 
tity can be chosen. The basic feature of this process is strikingly 
similar to the basic feature of the superior solution to Captain Mac- 
Whirr’s problem. 

The point here is not that there is necessarily a large class of 
economics problems that correspond to the fictional problem posed 
by Conrad and susceptible to solution by the method developed above 
in Section II. Rather, the fictional problem nicely illustrates that the 
apparent incompatibility between economic incentives and revealing 
true information does not mean that the information problem is 
insoluble. On the contrary, it indicates that a suitable entitlement 
structure restores the compatibility between incentives and true in- 
formation, thus allowing the information problem to be solved. 
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Concentration Changes and Inflation: 
Some Evidence 


E. Woodrow Eckard, Jr. 


General Motors Corporation 


Ina recent article in this Journal, Gisser and Johnson (1979; hereafter 
cited as G & J) examine the relationship between industrial concentra- 
tion changes and inflation. They develop an algebraic model relating 
concentration levels to an economy-wide price index, make assump- 
tions regarding parameter values, and then use the model to simulate 
a relationship between concentration changes and inflation. The cen- 
tral feature of the model is a Cournot-type specification of the re- 
lationship between concentration (or number of firms) and industry 
price level.’ Naturally enough, this produces a positive relationship 
between concentration and the price index and, by extension, be- 
tween concentration changes and index changes.? However, the re- 
lationship is weak in the usually observed concentration ranges, and 
so the authors conclude that “the act of reducing concentration, if 
successful, is likely to have little impact on the consumer price index” 
(p. 21377). 

The purpose of this note is to present some empirical evidence 
regarding this issue. In particular, do the facts support the positive, if 


Helpful comments from T. G. Marx and T. F. Walton are gratefully acknowledged. 
Any remaining errors and all opinions expressed herein are the sole responsibility of 
the author. 

* Rather than using, say, a four-firm concentration ratio, G & J (1979, p. 1378) assume 
each industry contains some number of identical firms. Concentration is therefore 
measured as the inverse of this number. 

* See Stigler (1968, pp. 36-37) for a description of the Cournot model. However, its 
theoretical validity, as applied to oligopoly pricing, has been seriously undermined (see, 
e.g., Stigler 1968, pp. 36-37; Scherer 1970, p. 135). The continued use of this 
model in industrial organization research appears to be due solely to its mathematical 
tractability. 
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weak, relationship predicted by the G & J simulation? First, a measure 
of the amount of inflation which can be “associated” with an indi- 
vidual industry is developed. Next, using Census of Manufactures 
data, this measure is calculated for a sample of four-digit industries 
over three time periods and then correlated with the corresponding 
changes in industry concentration.? While average economy-wide 
concentration has been relatively stable in recent years (as noted by 
G & J), concentration ratios for individual industries have varied 
sufficiently for identification purposes. The results indicate that con- 
centration changes are unrelated to inflation. 


An Inflation Measure 


This section develops an empirical inflation measure based on the 
theoretical model used by G & J.* In their model, G & J assume a two- 
good economy (X and Y) with a single factor of production and no 
intermediate goods. They could therefore write money income (J) as 
follows: 


T= pede + Pudy = V Sa + VSy, (1) 


where p is unit price, g is number of units, and VS is value of ship- 
ments. However, in a more general model, intermediate shipments 
occur. Therefore, to avoid “double counting,” money income must be 
written as a function of value added: 


N 


12S 13 VA, (2) 
i=1 


i=1 


where y; is unit value added (i.e., prime factor cost per unit) and V4; is 
total value added for the 7th of N goods. The use of unit value added, 
rather than price, is important in the present context because it 
eliminates the effect of (nonprime factor) input prices on the 
industry-specific inflation measure to be developed below. These 
prices are assumed to be taken as parameters by firms in any particu- 
lar industry. 

A Lespeyres-type “inflation index” can now be defined as follows: 


3 The logic of the market power-inflation relationship implies that the correct em- 
pirical specification is between inflation and concentration changes rather than con- 
centration levels. For a discussion of this point, see Stigler (1962, pp. 8-9). 

4 The inflation measure developed here is similar to that of Brainard and Lovell 
(1966, pp. 860-61). For a discussion of this measure in the context of a simple 
monetary model of inflation, see Eckard (1978, chap. 2). 
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N 
1,0 
> vid 


L=——= a (3) 
> va? 


where superscripts 1 and 0 refer to current and base years, respec- 
tively, and 'J° is a hypothetical nominal income calculated at current- 
year unit value addeds and base-year quantities.? As defined, 'J° is 
unobservable. However, this can be remedied via some algebraic 
manipulation. First, rewrite equation (3) as follows: 





Defining inflation as the change in L during a given period (AL = 
L' — L®), specifying L° = 1, and recalling equation (2) yields: 





‘(ut \ VAS =1 (4) 
+ » ( 9 \ar- EET son 
faa 





This equation states that a given amount of inflation can be par- 
titioned into components associated with each of N elements com- 
prising the economy. Each component is the proportionate increase 
in that element’s unit value added, weighted by its base-year. total 
value added as a fraction of total income. 

Abstracting from the effect of relative size (as represented by 
VAS/I°), the amount of inflation associated with each element is mea- 


° Note that this index overstates the “true” price-level change to the extent that (1) the 
increase in unit value added corresponds to a quality increase and (2) the output vector 


in period | differs from the initial period vector due to consumer responses to relative 
price changes. 
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sured by the first term inside the parentheses of equation (4). For 
convenience, this ratio is defined as follows: 
1 
U; 
RS 5 
er (5) 
The above is, of course, analogous to a change in average unit price. 


However, the confounding effects of price changes in inputs pur- 
chased from other industries are absent. 


Empirical Results 


In this section, R; is calculated for a set of “concentrated” four-digit 
SIC industries using Census of Manufactures data. This inflation 
measure is then correlated with concentration changes. But first it is 
necessary to convert equation (5) to a form which can be calculated 
from available data. This is accomplished by recalling from equation 
(2) that v; = VA;/q;; therefore: 


VAt/q} 
VA9/q? 


_ VA'VAY 
qilq 


Value added is available from the Census of Manufactures four-digit 
industry data.® While q; is not available, a proxy for the ratio g/q? can 
be obtained for certain periods from the Census of Manufactures 
Indexes of Production. These indexes are estimates of the change in 
physical output calculated at the four-digit industry level. 

Based on these data, R; is calculated for the time periods 1958-63, 
1963-67, and 1967-72.’ To restrict the sample to “oligopolies,” only 
those industries with four-firm concentration ratios (CR4’s) equal to 
50 percent or greater at either the beginning or the end of a period 
are included.’ The data for the 3 periods are then pooled. To nor- 


R; = 
(6) 


6 The census value-added data are at best a rough proxy for “true” value added, i.e., 
the amount of national income originating in each industry. In fact, census value added 
for all manufacturing industries has consistently overestimated national income 
originating in manufacturing by roughly one-third (see U.S. Department of Commerce 
1972, pp. xxxii-xxxiv). Because this error has been fairly consistent over the period 
covered by the study, we assume that the change in census value added is an unbiased 
estimate of the change in the “true” value added for each industry. 

7 These periods were stipulated by data availability. 

8 The selection of the 50 percent cutoff point is, of course, arbitrary. Its purpose is to 
avoid a possible bias by excluding those industries which few would argue possess the 
market power required for the adverse pricing effect postulated under the 
concentration-inflation hypothesis. The industry concentration ratios used are those 
published by the Census of Manufactures. Their shortcomings as measures of market 
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TABLE 1 


AVERAGE INDUSTRY INFLATION (R;) BY CHANGE IN CONCENTRATION FOR THREE POOLED 
Time Periops (1958-63, 1963-67, and 1967-72) 














Number of Average Standard 

Changes in CR4 Industries R; (< 100) Error 
i Omen cyiya > ee: 10 88.5 3.4 
=6 tor LOT ean. oe eee 32 91.4 Qa 
SS NetOy= 5) Fane ferment 76 93.1 es 
OE ESS eesrat eas 30 92.3 ied, 
tg ares Sees 85 95.9 9 
Gitomer ih 91.2 2.6 
> ean Ole een eee ae 14 89.5 3.6 
Mota rs aie eee cee 274 93.1 nL 





malize the observations for different “underlying” inflation rates in 
these periods, each R; is divided by the increase in the implicit price 
deflator for the appropriate period.® Finally, the data are grouped by 
the change in CR4. 

The results are presented in table 1 (note that the R;s are multi- 
plied by 100). The data for falling CR4’s appear to support the 
hypothesis that a positive relationship exists between CR4 changes 
and inflation.’° Average R; (x 100) declines from 93.1, for CR4 
reductions of 1—5 percent, to 88.5 for CR4 reductions of greater than 
10 percent. However, the data for rising CR4’s do not support the 
hypothesis. For CR4 increases of 1—5 percent, average R; (x 100) is 
95.9, while for CR4 increases of 6-10 percent and greater than 10 
percent, R; (x 100) declines to 91.2 and 89.5, respectively. Thus, R; 
appears to be more closely related to the absolute value of CR4 
change rather than to the direction of change. 

This is made clearer in table 2, where the data of table 1 are 
retabulated by absolute value of CR4 change. For industries with CR4 
changes greater than 10 percent in absolute value, average R; (x 100) 


concentration have been well documented (see, e.g., Bock 1972, pp. 11-30). These data 
are used here because they have come to dominate empirical thinking on concentra- 
tion, a development best explained by (1) their accessibility and (2) their corre- 
Pee to the mass of other data published by the census at the four-digit industry 
level. 

* The average industry concentration for the universe of four-digit SIC industries 
has remained essentially constant between 1958 and 1972 (see, e.g., Allen 1976, p. 665, 
table 1). Therefore, the inflation rates occurring in each of the three periods must have 
been due to other (i.e., monetary) phenomena. Removing the effects of these different 
“underlying” rates eliminates one major source of variance in the pooled data. 

* The simple correlation between R; and concentration change for all 274 obser- 


vations is positive although small in magnitude (r = .024) and statistically insignificant 
(t = .4). 
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TABLE 2 


AVERAGE INDUSTRY INFLATION (R;) BY ABSOLUTE VALUE OF CHANGE IN CONCENTRATION 
FOR THREE PooLepD TIME Pertops (1958-63, 1963-67, and 1967-72) 














Absolute Value of Number of Average Standard 
Changes in CR4 Industries R; (< 100) Error 
PO ree ie Strsneeytncts tine y sob waKberg-s 24 89.1 2.5 
Oa ame eter rR yer tc caps Sec cas 59 GIES 159) 
NR OM eases ope eae Ft eres goer at anvn ee 161 94.5 8 
ORs ate ir en eee 30 3 57/ 
UO UAE ee Sage chores Ss iotereyeesya es lsic Si soak 274 301 





equals 89.1. For industries with absolute concentration changes in the 
1-5 percent range, average R; (x 100) rises to 94.5.1! While the 
average R; (x 100) for the 30 industries with no change in CR¢4 is less 
than that for the 1—5 percent change group, it is still greater than the 
average R; (x 100) for 6 percent and greater CR4 change groups. 
Stated alternatively, for the 83 observations with CR4 changes greater 
in absolute value than 5 percent, average R; (x 100) equals 90.6. For 
the 191 observations with CR4 changes of 5 percent or less in absolute 
value, average R; (X 100) is 94.2. 

What might account for this pattern? One explanation is that com- 
petition in concentrated industries (as well as others) often takes the 
form of cost reductions through productivity increases. Since payroll 
costs are a substantial portion of value added, productivity increases 
would be discernible as reductions in unit value added, ceteris 
paribus. If such cost savings occur among one or more of the top four 
firms, CR4 rises as the firm (or firms) introducing the cost reduction 
enjoys a relative increase in sales. If a cost reduction occurs among the 
smaller firms, CR4 in general falls.1* However, in either case average 
industry costs fall. The data in tables 1 and 2 may therefore merely 
reflect a tendency for firms introducing greater cost reductions to also 
have greater sales increases, that is, to induce greater CR4 changes. 


4 'The simple correlation between R; and the absolute value of concentration change 
for all 274 observations is negative (r = —.18) and statistically significant at the 1 percent 
level. 

2 The exception occurs if (1) the sales increase of the smaller firm is sufficient to 
place it among the top four and (2) the CR4 of the new top four is not less than that of 
the old. Given the relatively short time periods analyzed (4—5 years), such exceptions 
are probably infrequent. 

8 Peltzman (1977) has examined the relationship between unit cost changes and 
concentration changes using a sample drawn from the same universe of census four- 
digit industries. He found that both concentration increases and decreases are corre- 
lated with cost decreases. However, for concentration decreases, the correlation is 
“statistically insignificant and only a fraction of the effect of similar increases” (p. 245). 
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One final observation is in order. Since each individual value of R; is 
corrected for the appropriate “underlying” inflation rate, an R; (x 
100) of 100 indicates a “contribution” to the inflation rate equal to that 
of the economy as a whole, that is, equal to the underlying rate. But, 
as indicated in table 1, the average R; (X 100) for all 274 observations 
is only 93.1. Thus, the “concentrated” industries in the Census four- 
digit industry universe, for the given periods, performed significantly 
better than the remainder of the economy." 


Conclusions 


In short, the evidence suggests that a policy of increasing concentra- 
tion would be as effective in easing inflation as a policy of decreasing 
concentration, which is to say, not very effective at all. That is because 
the concentration-inflation hypothesis which formed the basis for the 
G & J simulation has the direction of causality between concentration 
changes and price changes reversed. Concentration changes are in- 
duced by corresponding changes in the relative competitive strengths 
of firms in an industry as manifested by (among other things) price 
changes, not vice versa. In fact, the evidence suggests that neither 
increasing nor relatively high concentration is associated with “bad” 
inflationary performance. Policymakers will have to look elsewhere 
for a cure for inflation. 
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The Crisis in Keynesian Economics. By SiR JOHN R. Hicks. 
Oxford: Oxford University Press, 1974. Pp. 88. £1.75. 


My generation accepted the message conveyed by the General Theory with 
remarkable agility. The analytic interpretation of the message offered by Sir 
John Hicks effectively contributed to the victorious sweep. But the prospects 
of a “New Economics” faded within the first 2 decades of the postwar period. 
The promise of activist stabilization policies failed and eventually yielded a 
reality dominated by an apparently intractable inflation, protracted unem- 
ployment, and volatile exchange rates. Analytic reflections increasingly ques- 
tioned, moreover, the “Keynesian resolution” of the intellectual dilemma 
posed by the experiences of the interwar period. These experiences seemed 
incompatible with inherited price theory. They revealed what appeared to be 
an immanent flaw of the price mechanism. Two options were available: One 
would reformulate traditional price theory in order to subsume these experi- 
ences within an extended framework, and the other would abandon price 
theory and replace it with “sociology” or “social psychology.” Keynes chose 
essentially the second option. This was particularly reflected in the treatment 
of the labor market. It was also reflected in his approach to long-run expecta- 
tions and their relation to investment expenditures (remember the “animal 
spirits”). This influence pervades the Keynesian establishment and affects 
much of the “post-Keynesian” literature and the post-Keynesian approach to 
the inflation problem. 

The evolution of monetary theory over the postwar decades may be inter- 
preted to result from a systematic reconsideration of the Keynesian resolution 
of the intellectual dilemma. It was increasingly argued that an intellectually 
more productive resolution would continue a viable “classical program.” This 
program is centered on an extended price theory incorporating pervasive 
information problems but would discard the forms of the “neoclassic syn- 
thesis” successfully established during the early postwar phase. This back- 
ground of our current struggles attaches particular significance to the 
comments on “the crisis in Keynesian economics” offered by Hicks to the 
profession. 

His material is organized into three chapters: the first covering “the mul- 
tiplier”; the second attending to money, interest, and liquidity; and the last 
exploring wages and the labor market. The chapter on the multiplier exam- 
ines issues badly neglected in the traditional Keynesian literature. Hicks 
approaches in a somewhat implicit and indirect manner two aspects associated 
with supply behavior in the context of macroanalysis. The first aspect con- 
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cerns the role of stocks (i.e., inventories) in the multiplier process. The 
argument elaborates the importance of initial conditions expressed by the 
inherited pattern of stocks for the magnitude and persistence of the multi- 
plier response to variations in autonomous expenditures. The reader of these 
passages can hardly avoid wondering about the usefulness of activist fiscal or 
monetary policy against the background of this chapter. 

The other aspect of supply behavior involves the determination of prices. 
Hicks elaborates his previous differentiation between “fixprice” and “flex- 
price” market systems. He notes that the traditional multiplier analysis pre- 
supposes a fixprice system. The discussion of this problem offers some 
thoughtful and suggestive remarks to the reader. But it also contains linger- 
ing residues of the Keynesian resolution. This appears most particularly in 
the characterization of the distinct market systems. Flexprices are said to 
reflect the operation of supply and demand, whereas fixprice patterns move 
beyond any exposure to market forces. We also read that flexprices are 
formed by the market and fixprices are set by suppliers (or demanders). This 
description is quite misleading. All prices are set, and all reflect market forces. 
The issue centers on the relative frequency of “resetting” and the nature of 
the market forces shaping the price setting. Several explanations emerged in 
recent years to account for the relative inflexibilities of prices as a rational 
market response to specifiable uncertainties and transaction costs. One of the 
approaches emphasizes that prices are geared in the context of substantial 
transaction costs to perceived “permanent” market conditions. Price setting 
disregards under these conditions transitory changes in prevailing conditions. 
Changes in permanent conditions are moreover perceived only slowly, and 
prices are thus inflexible relative to the actual but insufficiently perceived 
underlying market conditions. Such extensions of price theory avoid the 
Keynesian resolution drifting into ad hoc arguments without a coherent 
analytic framework. 

The second chapter bears on aspects of the Keynesian transmission mecha- 
nism and the role of liquidity. The remarks on the latter point are closely 
related to an analysis which emphasizes that costs of information and transac- 
tion differ widely over the spectrum of assets. This variance assures the 
emergence of “transaction-dominating assets.” Future opportunities depend 
under the circumstances, as Hicks correctly elaborates, on current portfolio 
compositions. Moreover, the author argues cogently for a wider spectrum of 
substitution relations surrounding money beyond the narrow choice of 
“money versus bonds.” He still seems to confine, however, the transmission 
channels to returns on financial assets. The “City Syndrome,” so prevalent in 
the Keynesian traditions, finds it apparently difficult to comprehend that in 
most countries monetary impulses are dominantly transmitted, in the absence 
of developed capital markets, via substitutions between money and goods or 
real assets. Money as the transaction-dominating asset substitutes at the mar- 
gin in all directions. 

The last chapter is particularly informative and suggestive. Hicks discusses 
the lack of a wage theory in the Keynesian tradition. The reader also infers 
that Keynes’s “wage theorem” opened the gate to a post-Keynesian flood of 
inflation theories based on more or less explicit denial of economic analysis. 
Hicks emphasizes in particular the need for an endogenous incorporation of 
wage push. The formulation couched in terms of “economic” and 
“noneconomic” causes is singularly misleading, however. On the other hand, 
the attention given by Hicks to the comparative durability of market relations 
seems particularly promising for a useful approach to the explanation of 
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wage behavior. Such reasoning would link the labor market phenomena with 
evolutions on the output market indicated by the fixprice patterns. The 
allusions to “fairness” considerations in the wage process seem redundant, 
however. They have, as they stand, no basis in analysis and add nothing to the 
substance of Hicks’s important suggestions. 

The remarks of a thoughtful professional are always worth pondering, and 
Sir John Hicks certainly deserves our attention. But I wonder about his 
conclusion that “reformulated Keynes is much more like Keynes than it is like 
the cruder forms of neo-classical doctrine.” I wonder about this statement as 
intriguing suggestions strewn along the text lead us, as some analytic de- 
velopments would appear to do, beyond “Keynes” (reformulated or not) and 
also beyond “neo-classical exercises,” crude or sophisticated. 


KARL BRUNNER 
University of Rochester 


The Zero-Sum Society: Distribution and the Possibilities for Economic Change. By 
LesTER C. THuROw. 

New York: Basic Books, 1980. Pp. 230. $12.95 (cloth). New York: Penguin Books, 
1980. $4.95 (paper). 


This is a book about the economy of the United States, but it could be about 
almost any other Western developed economy. With perhaps one or two 
exceptions the advanced countries of the Western world face the same eco- 
nomic problems and have already several years’ experience of high unem- 
ployment, slow growth, and rapid inflation. They all—with one exception— 
face special problems arising out of the increased price of oil (the exception, 
Great Britain, has managed to have its own special problems as a result of not 
having an oil problem and having, instead, a deindustrialization problem 
caused partly by a high exchange rate). In many cases the problems that 
Thurow enumerates apply even more acutely to other countries. Some of 
them are even more dependent on imported oil than is the United States; 
most of them protect their ailing firms or industries more than does the 
United States; and most of them spend higher shares of their national prod- 
uct on welfare payments than does the United States. But give or take a 
problem or two, the picture he paints of the current economic performance 
of the United States is very familiar in other Western countries, and his 
diagnosis probably applies in large measure to the other countries as well. 

The main theme of the book is that the economic problems facing the 
American economy today are solvable but only at the cost of somebody losing 
in the process, and it is almost impossible these days to make the required 
losses stick on anybody. There are no longer powerless groups in society to 
whom the burdens of any economic change can ultimately be passed. Civil 
rights movements, black power, women’s lib, poverty lobbies, and so on have 
become more militant and effective. In addition, there has been increasing 
Balkanization of the country, and individual states are increasingly reluctant 
to make sacrifices in the wider national interest—for example, to be the state 
where the nuclear power stations are to be sited or where the open-cast coal is 
to be mined. 

As Thurow correctly points out, virtually any economic policy decision will 
affect the distribution of income, and those groups who are likely to be 
affected are now quick to recognize the threat and to take effective defensive 
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action. Until recently, changes in the distribution of income between groups 
that resulted from changes in the economic structure or from government 
policy measures could be accommodated within a growing economy. Now, 
however, economic growth has dramatically slowed down, so that we are 
playing a zero-sum game. Those who lose out relatively to the others also lose 
out absolutely, and this is more than can be tolerated within the present 
balance of political power. 

Thurow then goes on to show how this general theme applies to some of the 
major economic problems facing the U.S. economy today. He begins with the 
energy problem and explains how the free market solution—letting the price 
of oil rise to world levels—means a large cut in real incomes of certain sections 
of the population, particularly the poorest. Furthermore, although inflation 
usually only redistributes income among groups within the economy and does 
not necessarily impoverish the economy as a whole, it is still greatly resented 
so that the overall inflationary impact of rising energy prices has to be taken 
into account as well. 

And other measures to alleviate the energy problem—notably more resort 
to some of the more economically viable energy alternatives such as open-cast 
coal mining or nuclear power—are opposed by special local interests who are 
able effectively to delay projects to the point that they kill them off. And, 
anyway, speculative investment in alternative forms of energy is not attractive 
to private industry since the high price of imported oil is the result of a cartel, 
not of corresponding high costs of production. So if some alternative form of 
energy could be produced that was competitive with oil at, say, $30 per barrel, 
the oil producers could just cut their price, while still making fat profits, and 
put the new competitor out of business. Hence, large-scale investment in the 
search for alternative energy sources must rely largely on the government one 
way or another. But that means more taxes, and we are all against them. 
Moral: Whatever way one turns, there seems to be no effective solution in 
sight to the energy problem without some group or other getting hurt, and all 
the potential victims are now able to protect themselves. 

As regards the problem of inflation, Thurow’s analysis follows that of many 
other commentators and is, in a sense, somewhat inconsistent with much of 
his basic theme. He spells out the widespread view that inflation is largely the 
result of certain key groups in society being able to protect themselves against 
cuts in their real incomes. Since prices and wages are flexible upward but not 
downward, random changes in various prices and costs will have an upward 
ratchet effect on the overall price level. 

All this is familiar, but, in addition, Thurow introduces into the story an 
analysis of the labor market which is more original and is the one he has set 
out in more detail in an earlier book (Generating Inequality) and which demon- 
strates why a cut in demand and an increase in unemployment is likely to have 
a negligible effect on the rate of inflation. (This is very much in line with many 
of the econometric macro models on the market these days—not that 
confirmation from current econometric models is necessarily much comfort 
to anybody.) 

But a major conclusion of Thurow’s analysis is that the burden of futile 
restrictionist monetary or fiscal policies falls mainly on those who are already 
the weakest economic groups in the economy—such as young people, 
women, or ethnic minorities. As he puts it: “If a recession is to be used to stop 
inflation, we draft inflation fighters in a very uneven pattern. In proportion to 
their size in the labor force, sixteen to twenty-four-year-olds are three times as 
likely to be drafted as adults. Females are 38 percent more likely to be drafted 
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than males; blacks are twice as likely to be drafted as whites; and Hispanics are 
75 percent more likely to be drafted than whites” (p. 65). 

However, in this part of the book Thurow seems to be arguing that society 
does contain quite a sizable number of “fall guys” who can bear the burden of 
whatever particular policy society adopts. Hence, what Thurow needs to do in 
order to ensure greater consistency with his basic theme Is to show that the fall 
guys in the inflation case cannot be made to bear the burden of the adjust- 
ment that would be needed in order to solve the other problems. This might 
not be difficult to do—for example, a rise in gasoline prices would hit white 
males aged 25-64 as much as, if not more than, automobile owners in other 
age, sex, or ethnic groups. 

Thurow goes on to discuss other current issues, some of which are clear 
applications of the zero-sum game theme, such as the environmentalist pres- 
sures which really involve special pleading by the middle classes to protect 
those aspects of their standards of life that have moved up in the ranking of 
their priorities and that can only be fully satisfied at the cost of sacrifices by 
the poorer groups of the population. 

There is also a connection between the zero-sum game theme and 
Thurow’s discussion of the reasons for the slowdown in the rate of growth in 
productivity in America during the last decade. For he attributes a large part 
of the slowdown to government policies that prop up ailing firms or industries 
in one way or another, again because of the effectiveness of political pressures 
to ensure that nobody suffers from economic change—at least as long as the 
potential victims are identified in terms of firms, or industries, or cities, or 
regions, and not in terms of groups of individuals, such as young people or 
ethnic minorities, for whom the old American virtues of self-reliance and 
individual initiative must be preserved and protected against the insidious 
effects of income support schemes. 

Throughout the book Thurow repeatedly refers to the overriding impor- 
tance of society achieving some sort of consensus as to the criteria that should 
be used to decide which groups should lose out as a result of some policy, and 
by how much and in what circumstances, and so on. He provides some idea of 
what sort of criteria of distributive justice might be reasonable, one of the 
most important of which is that the earnings dispersion of all groups in the 
economy should be no more unequal than it is now for fully employed white 
males, who, he shows, have a much smaller dispersion than other groups. 
Presumably, he would also want to change the relationship between the mean 
earnings of white males and of these other groups, since the latter might not 
complain about having higher dispersion if their means were twice as high 
and, conversely, they might not be all that pleased to have zero dispersion if 
their mean was 10 percent of that of white males. 

There are, first of all, some technical objections to Thurow’s rather casual 
use of earnings dispersions for welfare purposes, such as the fact that earners 
are less than half the population, or that earnings may not be highly corre- 
lated with family incomes per capita (a measure that Thurow uses in a 
different context, although adult equivalent units would have been a better 
deflator). But the big question is how to achieve the desired dispersion. 
Thurow advocates some kind of governmental guaranteed job program—not 
at minimum wages but at the wages appropriate to the skills and so on of the 
people in question. However, the precise functioning of this program is not 
spelled out in much detail. 

One is likely to be equally skeptical as to the chances of society reaching 
some sort of consensus on the various distributive objectives that Thurow 
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enumerates. Distributional criteria have been the subject of debate since 
Aristotle (if not earlier), who first introduced the concept of “distributive 
Justice” and the notion that people should be rewarded in proportion to their 
merits. However, as Aristotle pointed out, there is usually much disagreement 
as to what constitutes a merit. And in the following 2,000 years or more, not 
much progress has been made in sorting it all out to everybody’s satisfaction. 
Just appointing a couple of philosophers to settle the matter for us will not 
help; there is a chance that the two appointed would be Rawls and Nozick. 

Governments have to obtain some idea—however vaguely—as to what the 
public might, on the whole, prefer and, in the light also of other considera- 
tions of the national interest, act accordingly. If, of course, one is saddled with 
a government that is incapable of acting since its leaders want to be loved by 
everybody, it is not surprising that no clear distributive policy is ever adopted. 

Of course, Thurow recognizes at one point that, perhaps, he is being naive 
in supposing that society can solve its problems, and he recognizes that “in the 
records of history, we certainly would not be the first society that failed to 
come to grips with its fundamental internal problems” (p. 207). Of course, 
problems that seem insoluble at any particular point of time—in the sense 
that the theoretical solutions are not feasible—sometimes become soluble with 
the passage of time. The actors in the drama may change or learn new lines. 
Thurow’s message might actually get through to some of them. He has an 
extraordinary gift for clarity and simplicity of exposition, illuminated from 
time to time with striking phrases, without being technically inaccurate. Such 
inaccuracies as I have spotted have not been the result of oversimplification 
but of occasional lapses into carelessness, such as his incorrect attribution to 
two authors of an article by only one of them (Malcolm Sawyer), or his failure 
to explain why at one point (p. 11) he says that “welfare” (i.e., income support 
payments) constitutes only 1.2 percent of GNP in the United States, whereas 
at another point (pp. 155 and 156) he says that redistributive transfer pay- 
ments amount to over 10 percent of GNP. 

On the whole the diagnosis is probably fairly mainstream though Thurow 
does well to bring out the zero-sum game element in the situation and their 
distributional consequences, as well as drawing all the strands together in a 
more coherent and authoritative exposition than is easily available elsewhere. 
It is his policy prescription that is the least convincing part of the book. 
Perhaps there is no solution and never will be. So perhaps we just have to 
suffer pseudo-controlled experiments, such as the current experiment in 
testing the British economy to destruction. It remains to be seen whether the 
U.S. constitutional setup can prevent its being subjected to similar experi- 
ments. 


WILFRED BECKERMAN 
Balliol College, Oxford 
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An Empirical Model of Labor Supply 
in a Life-Cycle Setting 





Thomas E. MaCurdy 


Stanford University and National Bureau of Economic Research 


This paper formulates and estimates a structural intertemporal 
model of labor supply. Using theoretical characterizations derived 
from an economic model of lifetime behavior, a two-step empirical 
analysis yields estimates of intertemporal and uncompensated sub- 
stitution effects which provide the information needed to predict the 
response of hours of work to life-cycle wage growth and shifts in the 
lifetime wage path. 


Introduction 


Over the past several years there has been considerable activity in 
formulating life-cycle models of labor supply. Most of this work has 
gone unnoticed in the empirical literature. This study develops an 
estimable model of labor supply that fully incorporates life-cycle fac- 
tors, and it devises simple econometric procedures for estimating this 
model. 

Most empirical work on labor supply assumes decision making 1n a 
one-period context. Typically, annual hours of work are regressed on 
the current hourly wage rate and some measure of property income. 
A worker, however, determines his current labor supply in a life-cycle 
setting. Unless credit markets are “perfectly imperfect” and there is 
no human capital accumulation, the supply of labor is a function of 
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current and future discounted wage rates as well as wealth and con- 
straints in other periods. Accordingly, regressions of hours of work 
on current hourly wage rates yield a wage coefficient that confuses the 
response of labor supply to wage changes of three types: those arising 
from movements along a given lifetime wage profile, those arising 
from shifts in the wage profile, and those arising from changes in the 
profile slope. As a result, the wage coefficient usually reported in 
empirical studies has no behavioral interpretation in the context of a 
life-cycle framework. 

The theory underlying the model of lifetime consumption and 
hours of work in this paper represents a natural extension of Fried- 
man’s (1957) permanent income theory to a situation in which the 
relative price of consumption and leisure varies over the life cycle. 
Theoretical characterizations of consumption and labor supply de- 
rived from this theory sharply distinguish between factors determin- 
ing a consumer’s dynamic behavior and factors determining dif- 
ferences in consumption and hours of work across consumers. This 
separation leads to a manageable empirical model that allows one to 
discriminate the responses of labor supply to wage changes attribut- 
able to movements along a lifetime wage profile from those responses 
attributable to parametric changes in this profile. In addition, one can 
estimate the effects of wealth and demographic characteristics on 
lifetime hours of work. 

The organization of this paper is as follows. Section I outlines an 
economic model of life-cycle behavior. Section II develops and dis- 
cusses an empirical model of labor supply. Section III interprets the 
parameters of this empirical model. Section IV contains the empirical 
analysis. 


I. A Life-Cycle Model of Consumption and Labor Supply 


The consumer is assumed to choose consumption and hours of leisure 
at each age to maximize a lifetime preference function that is strongly 
separable over time, subject to a wealth constraint. Let utility at age t 
be given by the concave function U[C(t), L(t)], where C(t) is the 
amount of market goods consumed and L(t) is the number of hours 
spent in nonmarket activities at age t. The consumer starts life with 
assets 4(0) and operates in an environment of perfect certainty. At 
each age ¢ he faces a real wage rate equal to W(t) assumed to be 
exogenously given. The consumer can freely borrow and lend at a 
real rate of interest equal to r(t) in period ¢, and his rate of time 
preference is p. A lifetime is assumed to consist of T + 1 periods with 
L* being the total number of hours in each period. 
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Formally, the consumer’s problem is to choose C(t) and L(t) at each 
age to maximize the lifetime preference function 


oat 
ofS (1 + py U[C(t), Li (1) 


t=0 


subject to the wealth constraint 
ih T 
A(0) + RON (W(t) = YRC, (2) 
t=0 t=0 


where G(-) is a monotonically increasing function, N(t) =L* — L(t) is 
hours of work at age t, and R(t) = 1/{[1 +7(1)][1 + r(2)]...{1 + r(t)]} 
is the discount rate which converts real income in period ¢ into its 
period 0 equivalent with R(0) = 1. 

Conditions for an optimum are satisfaction of the lifetime budget 
constraint and 


U{C(t), L()] =R((1 + p)’r, t=0,...,T, (3) 
U{C(t), L()])=R((1 + p)AW(t), t=0,...,T, (4) 


where subscripts denote partial derivatives and X is defined by A = 
A*/G’, where A* is the Lagrange multiplier associated with the wealth 
constraint (i.e., A* is the marginal utility of wealth in period Q) and G' 
is the derivative of G. According to condition (3), consumption is 
chosen so that the marginal utility of consumption equals the margi- 
nal utility of wealth after adjusting for a discount factor which de- 
pends on the rate of time preference and the rate of interest. Condi- 
tion (4) determines the consumer’s choice of leisure. If it is an 
equality, then a positive amount of labor is supplied to the market. If 
it is a strict inequality, then all time is devoted to nonmarket activities. 

Using the definition of labor supply (i.e., N[¢] =L* — L[t]) and the 
implicit-function theorem, it is possible to solve equations (3) and (4) 
for consumption and labor supply as functions of the form 


CH SCIRO1+ pr, WO) £=0,...,T, (5) 
Ni =NIR (OG pa, Wi), t= 0,-4.7. (6) 


The functions C(-,-) and N(-,-) depend only on the functional form 
of U(-,-). As a consequence of concavity of U and the assumption that 
consumption and leisure are normal goods, they satisfy 


C200 NS 0 (7) 
1 See Heckman (1974, 1976) for proofs of these inequalities. Heckman develops and 


uses functions equivalent to those given by eqq. (5) and (6) in his analysis of the behavior 
of consumption and labor supply over the life cycle. 
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These consumption and labor-supply functions allow for corner so- 
lutions for hours of work either at age ¢ or at any other age t’. No 
matter what the consumer’s labor-force participation pattern over his 
lifetime, consumption and labor-supply decisions at any age (includ- 
ing the decision to set hours of work equal to zero) are completely 
determined by the functions C(-,-) and N(-,-) and the values of the 
variables R(t)(1 + p)'A and W(t).? 

The relationships given by (5) and (6) hereafter will be referred to 
as the “A constant” consumption and labor-supply functions. They 
represent the marginal utility of wealth constant demand functions 
for consumption and leisure for a particular form of the lifetime 
preference function given by (1), namely, the one obtained when G is 
the identity transformation. For this particular choice of G, \ is the 
marginal utility of wealth in period 0. Given a choice of U(-,-), it is 
theoretically possible to compute a unique value for \ using data on 
an individual’s consumption, labor supply, and wage rate at a point in 
time. This fact receives much attention in the formulation of the 
empirical model which is discussed in the next section. 

Substituting the A constant consumption and labor-supply func- 
tions into the budget constraint given by (2) yields the equation 


i 
AO) = S R(t) {CLR + pA, W(H)}. * 
t=0 


— WOINIRO(L + p)'r, WO}. 


This equation implicitly determines the optimal value of A; A, then, 
can be expressed as a function of initial assets, lifetime wages, interest 
rates, rates of time preference, and “consumer tastes.” Concavity of 
preferences implies 


On Or 
—— < ——_ = ea 2 
a4) <0 and aU Ce ee (9) 

* If it is optimal for the consumer to work at age ¢, then condition (4) is an equality 
and the functions C(-,-) and N(-,:) represent the solutions of eqq. (3) and (4) for the 
variables C(t) and N(t) = L* — L(8), respectively. If, on the other hand, the necessary 
condition given by (4) is an inequality, i.e., U, [C(t), L*] > R(t)(1 + p) W(t), then the 
consumer chooses not to work. In this case, N(-,-) = 0 and the function C(-,-) is the 
solution of the equation U ,[C(¢), L*] = R(t)(1 + p)*d for C(t). In either case, C(-,-) and 
N(-,-) only contain the variables R(¢)(1 + p)'s and W(t) as arguments and their 
functional form depends only on the form of the period ¢ utility function U(-,-). For 
further discussion on this issue see Heckman and MaCurdy (1980). Introducing age 
dependence into the utility function does not change any of this analysis. If the utility 
function at age ¢ is given by U[C(t), L(t), X(t)], where X(t) is a vector of time-varying 
determinants of “consumer tastes,” then a third argument X(t) enters the consumption 
and labor-supply functions given by (5) and (6). These functions satisfy the restrictions 
given by (7), and they also allow for corner solutions. 

°See Heckman (1974, 1976) for proof of these propositions. 
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Inspection of the A constant functions reveals that consumption and 
labor-supply decisions at a point in time are related to variables 
outside the decision period only through Xd. Thus, except for the value 
of the current wage rate, A summarizes all information about lifetime 
wages and property income that a consumer requires to determine his 
optimal current consumption and labor supply. At any age, any path 
of wages or property income over a consumer’s lifetime that keeps A 
and the current wage constant implies the same optimal current 
consumption and labor-supply behavior. 

The ) constant functions represent an extension of Friedman’s 
(1957) permanent income theory to a situation in which the relative 
price of consumption and leisure varies over the life cycle. According 
to these functions, current consumption and labor-supply decisions 
depend on a permanent component and the current wage rate. The 
variable X is like permanent income in the theory of the consumption 
function. At each point in time it is a sufficient statistic for all historic 
and future information about lifetime wages and property income 
that is relevant to the current choice of consumption and labor sup- 
ply. The usual concept of permanent income or wealth does not 
qualify as a sufficient statistic for this retrospective and prospective 
information.* Given knowledge of permanent income, a consumer 
also requires information on future wages to determine his optimal 
current consumption and labor supply. Only if wages are constant 
over the life cycle, or labor supply is exogenously determined, can A 
be written as a simple function of permanent income or wealth. 

The constant consumption and labor-supply functions fully 
characterize a consumer’s dynamic behavior in a world of perfect 
certainty.® According to these functions, there are two reasons why a 
consumer might change his consumption or hours of work as he ages: 
(1) the real wage rate changes, or (2) the rate of interest varies and is 
not equal to the rate of time preference.® 


4 Permanent income here is defined as that stream of income whose discounted value 
equals the consumer’s wealth in present value terms. Formally, permanent income in 
period 0, Y,, is defined by the equation 4(0) + Donley NE) VV (cma oh ee LUG) 

5 See MaCurdy (1978, 1980, in press) for a discussion of the uncertainty case. It is 
shown in these papers that with minor modifications the empirical specifications of 
labor supply developed in this section and their implementation in the following 
sections are consistent with a world in which the consumer is uncertain about his future 
lifetime path of wages and property income. 

6 Changes in a consumer’s tastes can also be a reason for adjustments in consumption 
and labor supply over the life cycle. As discussed in n. 2 above, if the period utility 
function is age dependent, the \ constant consumption and labor-supply functions will 
also be age dependent. It is still true, however, that the X constant functions fully 
characterize a consumer’s dynamic behavior. 
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II. An Empirical Model 


This section formulates an empirical model of labor supply that is 
based on the economic model described above. Specific functional 
forms are proposed for the d constant labor-supply function and for 
the relationship between ) and such variables as lifetime wages and 
initial assets. The following discussion assumes the availability of 
panel data. 


An Empirical Specification for the \ Constant 
Labor-Supply Function 


Assume that consumer 7 at age ¢ has utility given by 
UIC), L(t] = Yi (C(t) — Y, (HN AO ]°2, (10) 


where 0 < w, < 1 and w, > 1 are time-invariant parameters common 
across workers, and Y,,(t), Yo{t) > O are age-specific modifiers of 
“tastes.” The variables Y,,(¢) and Y>,(t) depend on all of consumer 2’s 
characteristics which plausibly affect his preferences at age t; these 
characteristics may include such variables as the number of children 
present at age ¢, the consumer’s education, and even age itself. 

Assuming an interior optimum,’ the implied \ constant labor- 
supply function for consumer i at age ¢ in natural logs is 





a 1 {In Ni oF In Y(t) alt) Wo 


@e 


(11) 
+ In[R()(1 + p) + InW,(e)}. 


Assuming that “tastes” for work are randomly distributed over the 
population according to the equation In Yo(t) = oj, — u¥(t), the 
labor-supply function can be written as 


InNd) =Fi + 8 > [p— rl + 8lnWK) + ud), (12) 
k=0 


where F; = [1/(@. — 1)](In d; — o; — In 2), 6 = 1/(@. — 1), ut) = 
du¥(t), (0) = p, and we have used the approximations In [1 + r(t)] ~ 
r(t) and In(1 + p) ~ p. The unobserved variables o, and u,(t) represent 
the unmeasured characteristics of consumer i; 9g; is a permanent 
component, and u,(t) is a time-varying error term with zero mean. If 
we assume that the real rate of interest, r(t) for ¢ = 1, is constant over 


"Since this study’s empirical objective is to examine the labor-supply behavior of 
prime-age males, this assumption is not unreasonable. 
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the life cycle and equal to yr, then the dX constant hours of work 
function reduces to 


where b = &(p — r).8 

The intercept term F; in this equation represents a time-invariant 
component that is unique to individual 7. This study treats F; as a fixed 
effect. Since F; contains In ); as one of its components, one cannot 
assume that F; is a “random factor” uncorrelated with exogenous 
variables of the model. Inspection of equation (8) reveals that A 
depends on the values of variables and constraints in all periods. By 
construction, F; is correlated with any exogenous variables used to 
predict a consumer’s wages or wealth. Hence, treating F; as part of the 
error term would result in biased parameter estimates of the labor- 
supply function. Treating F; as a fixed effect, on the other hand, 
avoids this bias. 

Estimating the parameters of equation (12) only requires variables 
observed within the sample period. Regressions of current hours of 
work on individual specific intercepts and current wage rates produce 
a full set of parameter estimates.® Because F; captures the effect of In 
hi, its estimated value summarizes all of the retrospective and pro- 
spective information relevant to consumer ?’s current choices. As there 
is no need to forecast any life-cycle variables that are outside the 


® There are other forms of the utility function that have convenient empirical 
specifications for the constant consumption and leisure demand functions. ‘Iwo such 
functions are 


U;(t) = K(A)LC(t) + wE(OI LA) + w(t)”, 
K;(t), @*, @ > 0, a@* + w < 1 or k,(t), w*, w < 0; 


[Cl +mROI” = 1 yy Ld + (DP = 1 
w* ‘ w ; 


KF (t), Ki(t) 2 0, w*, ac i 


U,(t) = KF(2) 


where x(t), Kt), uF(t), wt), o*, and w are all parameters. Both of these functions are 
concave. They include Cobb-Douglas, addilog, CES, and Stone-Geary as special cases. 
The d constant functions for consumption and leisure are log linear in \, wages, and the 
coefficients «#(t) and x,(t), which represent specific modifiers of tastes. This study uses 
the utility function given by (10) to formulate an empirical model because it implies a 
form for the labor-supply function which can be readily compared with labor-supply 
equations found in existing empirical work. 

®If utility at age ¢ depends on measured characteristics of the consumer that vary 
over the sample period, then current values of these “taste-shifter” variables would also 
be included as regressors. A natural way to introduce such taste-shifter variables is to 
model the taste coefficient Y,(¢) as a function of the form In Y,t) = 0; + X(t)B — uF(t), 
where Xt) is a vector of variables influencing tastes and £ is a parameter. For this case, 
X ,t)B6 enters as an additional linear term in the A constant labor-supply equation given 
by (12) or (13). 
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sample period, the A constant functions afford a considerable sim- 
plification of the empirical analysis. 

Use of the constant functions allows one to estimate parameters 
needed to characterize dynamic behavior without introducing any 
assumptions regarding a consumer’s behavior outside the sample 
period. To appreciate this point, consider the problem of predicting 
the additional hours of work a consumer will supply in response to 
observing a higher wage rate than he observed at a younger age. To 
obtain an estimate of this response using a traditional model of life- 
cycle labor supply, one must formally incorporate the worker’s future 
plans in the model. For example, if the worker anticipates an early 
retirement, future wages corresponding to the retirement years do 
not influence current labor supply. Thus, the researcher must not 
include these wages as explanatory variables. Such considerations lead 
to difficult data requirements and complicated estimation procedures. 
Using equation (12), on the other hand, it is possible to analyze this 
problem without knowing anything about a worker’s future plans; an 
individual constant term for each worker accounts for a worker’s 
future plans in a parametrically simple way. 


An Empirical Specification for Individual Effects 


Estimation of the \ constant labor-supply function given by (12) does 
not directly estimate all of the parameters required to characterize all 
aspects of labor supply. Differences among individuals in initial 
wealth or lifetime wage paths affect the level of hours of work 
through F;. To explain any aspect of labor supply other than dynamic 
behavior (e.g., how the hours of work of two individuals differ at a 
point in time), one must confront the problem of predicting indi- 
vidual effects. 

From the theoretical analysis above, we know that the value of F, or 
more properly X, is uniquely determined by the implicit equation 
given by (8). This equation does not admit an analytical solution for \ 
given the specific form of the utility function given by (10), even if it is 
known that this function applies to all ages and the consumer works in 
each period. The variable } is a complicated function of initial assets, 
lifetime wages, the interest rate, the rate of time preference, and 
parameters representing unobserved “taste” variables.!° Using such a 
relationship as an empirical specification is not feasible. 


* The equation determining }; is 


T. 
= l ty _|1w@y-1 
AiO) = > RO || ROU + pytn,|iter» 


1 


— W; Pel a oll 
(| Voi (d)@» 


RCL + pyaar fer}, 
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This study assumes that equation (8) implies a solution for \ in 
which In ) can be approximated as a linear function of measured 
characteristics, the natural log of wages at each age, initial wealth, and 
an unobserved random variable representing unmeasured character- 
istics. With this assumption the implied equation for F; is 


ae 
F,=Zo + > y(t) nW{(t) + 4,(0)8 + a;, (14) 
t=0 
where Z; is a vector of observed variables (e.g., family background 
variables), a; 1s an error term, and @, y(t), and @ are parameters 
assumed to be constant across consumers.!! This structural relation- 
ship for F; implicitly assumes that each consumer has a working life of 
T* + 1 years. According to the theoretical restrictions given by (9), the 
y(t)’s and 6 should all be negative. 

Unfortunately, to formulate an estimable version of an equation for 
F;, we require additional assumptions concerning the forms of the 
lifetime wage and income paths. In contrast to the ) constant labor- 
supply function, estimating the parameters of equation (14) requires 
data which normally are not available. Most variables appearing in 
this equation are not directly observed, including the dependent 
variable F;, wages outside the sample period, and initial wealth. While 
estimates of F; are obtained as a by-product from estimating equation 
(12), we still require a mechanism for predicting wages outside the 
sample period and initial permanent income. We do this by intro- 
ducing lifetime profiles for wages and income. 

This study assumes that the lifetime wage path is 


In Wt) = wa + tru + Pare, + Vit), (15) 


The assumption that these parameters are constant across consumers is, of course, 
only an approximation. Formally, it can be shown that 


Olnd d\n N(t) dInd 
ee ee eee 
WW) = oa WH dInd | 8A (0) 
where E*(t) = N(t)W*(t) and W*(t) = W(t)R(t) are the period 0 present value of 


earnings and wages in period t. This relationship for y(t) is derived by differentiating 
Roy’s identity for N(t) with respect to A(0) to obtain the equation 


E*(t), 














On ON(t) On = any Pare | OlnXd 


away aac) 7 G4) ana 9A(0) 


E*(t). 





— dr fdlnN(t) ein 
al alnXd 0A (0) 


Since the empirical specification for the \ constant hours of work function implies [0 In 
N(t)\/(@ In d) = 4, we see that we cannot formally have y(t) = 6 (0 In d)/Ld In W(t)] and 8 
= §(0 In A)/[AA(0)] constant. Notice that the effects of the interest rate and time 
preference on F; are absorbed into the coefficients of specification (14). 
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where qo 74, and 7»; are linear functions of the form 
ji Mig; J = 0, i a (16) 


M, is a vector of exogenous determinants of wages which are constant 
over the consumer’s lifetime (e.g., education and background vari- 
ables), g;,7 = 0, 1, 2, are vectors of parameters, and V(t) is an error 
term. This path assumes that wages follow a quadratic equation in age 
with an intercept and slope coefficients that depend on age-invariant 
characteristics of the consumer.’ 

Predicting a consumer’s initial wealth is complicated by the fact that 
most data sets do not contain extensive measures of even the con- 
sumer’s current wealth. Some measure of the consumer’s property or 
nonwage income during the sample period, however, is usually avail- 
able. Let Y(t) and A,t) denote the property income and assets of 
consumer ? at age ¢. If Y;,(t) is the income flow generated by investing 
assets A(t) at a rate of interest equal tor, we have the relationship Y {t) 
= A(t)r. Assume that the following quadratic equation in age ap- 
proximates the lifetime path for property income: 


Y(t) = Aoi ai tay; aa tao; IF v(t), (17) 
where Qj, Q@1;, and a; are linear functions of the form 
O58 di, foe O51, 2, (18) 


S;,1s a vector of measured age-invariant characteristics of consumer i 
(e.g., education and background variables), g;, 7 = 0, 1, 2, are 
parameter vectors, and p(t) is an error term.!4 The intercept a; can 
be thought of as a measure of consumer 2’s permanent income at age 
0; that is, ag = A,0)r.% 

Combining the lifetime paths for wages and income with equation 
(14) creates an equation for F; that can be estimated using data 


*In the following analysis it is assumed that this wage equation generates unbiased 
predictions for lifetime wages. These predictions need not be efficient nor do they need 
to be the same predictions used by consumers. It is possible to introduce many alterna- 
tive forms for the wage equation, such as higher-order polynomials or polynomials with 
other functions of time (e.g., reciprocals) replacing ¢ and #2, with only minor modifica- 
tions of empirical specifications used in the following analysis. 

‘8 These income measures seldom include imputed income generated by consumer 
durables, which is a major source of property income for most consumers. 

14 Tn contrast to W(t), Y(t) is determined endogenously in this model. Eq. (17) can be 
viewed as an approximation to the optimal lifetime path for Y(t) expressed as a function 
of the exogenous variables of the model. 

*Formally, this relationship between a; and A 0) is correct only in a continuous time 
framework. When modeling the problem in discrete time, one must distinguish assets 
held at the beginning, at the end of the period, and exactly when asset income is earned 
within the period. 
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observed within the sample period. Substituting the wage process 
given by (15) and the relationship ao; = rA (0) into equation (14) yields 


F,=Zgh + Too + TM1iy1 + Tety2 + aod + ni (19) 
or substituting relations (16) and (18) yields 
F,=Kab + a, (20) 


where 
me a 
I= DPD, j=01,2,  6= lr, (21) 
t=0 


K; is a vector including all age-invariant characteristics determining 
either wages, income, or J (1.e., all the elements of M;,S;, and Z;), wisa 
vector of coefficients, and »; is a disturbance term which is randomly 
distributed across workers with zero mean. Equation (19) is a struc- 
tural relationship between F and the characteristics of a consumer’s 
wage and income profiles. The empirical analysis of this study focuses 
on estimating the parameters of this equation, ¢, Yo, 71, Y2, and 6. As 
we shall see shortly, these parameters have a sound economic inter- 
pretation. Equation (20) is essentially a reduced-form equation for 
(19). By estimating the parameters of this equation, it is possible to 
predict how F varies across consumers using only age-invariant char- 
acteristics of the consumer as explanatory variables. 


III. Interpretation of Parameters 


In investigating the effect of changes in wages on labor supply, it is 
important to separate parametric change of the sort usually con- 
templated in comparative static exercises from evolutionary change 
due to movement along a life-cycle wage path.’® A parametric wage 
change refers to shifts in a life-cycle wage profile (e.g., a shift from 
path II to path I in fig. 1), while an evolutionary wage change refers to 
movements along a given profile (i.e., along any path in fig. 1). Thus, 
parametric wage changes refer to differences in wages across con- 
sumers, while evolutionary wage changes refer to differences in wages 
across time for the same consumer.” 

Consider the behavior of labor supply over the life cycle. As a 
consumer ages, he adjusts his hours of work in response to the 


16 This distinction goes back to Ghez and Becker (1975). 

17 This statement is true only in an environment of perfect certainty. If there is 
uncertainty about the future, a consumer can experience parametric wage changes as 
he acquires new information about his lifetime wage path. For a discussion of these 
issues see MaCurdy (1980, in press). 
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different wage rates he observes at each point in his lifetime. These 
labor-supply adjustments represent responses to evolutionary wage 
changes; they reflect the consumer’s desire to supply more hours in 
those periods with highest wages. There is no wealth effect associated 
with this kind of wage variation since the wage profile is known to the 
consumer at the beginning of his lifetime and changes in wages are 
due only to movement along this given profile. It is apparent from the 
labor-supply function given by (12) that the value of the parameter 6 
determines the hours of work response to evolutionary wage changes. 
Hereafter, I will refer to 6 as the intertemporal substitution elasticity. 
The theoretical prediction for its sign is positive. For the particular 
form of the utility function given by (10), 6 is also the direct elasticity 
of substitution for hours of work in any two periods. 

Now compare the labor-supply profiles of two consumers who face 
wage paths II and III, respectively. As illustrated in figure 1, the wage 
profiles for consumers II and III are the same except at age ¢t’ when 
consumer III’s wage rate is higher than consumer II’s. Let A denote 
the absolute value of this difference in period t’ wages. This wage 
difference represents a parametric wage change because it involves a 
shift in the lifetime path of wages. It causes the labor-supply profiles 
for consumers II and III to be different at all ages. Comparing these 
labor-supply profiles is the sort of problem usually considered in 
comparative static exercises. In terms of the empirical model outlined 
above, this higher wage rate has two effects on consumer III’s labor 
supply. The first effect is on the value of F. According to equation 
(14), consumer III will set a value for F which is lower than the value 
of F for consumer II by an amount equal to y(t") - A. This decline in F 
implies that at all ages other than t’ consumer III’s labor supply will be 
less than consumer II’s by a constant fraction. At age t’ there is a 
second effect of the wage difference. Neglecting the decline in F, 
consumer III’s labor supply at age t’ will be higher by an amount 
equal to 6 - A. Thus, the total impact on consumer III’s hours of work 
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at age t’ is [5 + y(t’)]-A. Since 56 > 0 and y(t’) < 0, there is no sign 
prediction for 6 + y(t’), so consumer III’s hours of work at age t’ may 
be greater than or less than consumer II’s. 

The parameters y(t’) and 6 + y(t’), then, determine the difference 
in consumer II’s and consumer III’s labor-supply profiles which is 
due to the discrepancy in their wage rates at age t’. The quantities 
y(t') and 6 + y(t’) correspond to the usual concepts of cross- and 
own-uncompensated substitution elasticities. These elasticities de- 
scribe the response of labor supply to parametric wage changes. They 
can be used to predict differences in labor supply across consumers. 
These elasticities do not directly provide information on the response 
of labor supply to evolutionary wage changes, so they cannot be used 
to predict differences in a given consumer’s labor supply over time. 
Since the intertemporal substitution elasticity exceeds the own- 
uncompensated substitution elasticities (i.e., 5 > 6 + yl[t’]), one ex- 
pects an evolutionary wage change to induce a larger labor-supply 
response than a comparable parametric wage change. The wealth 
effect associated with a parametric wage change accounts for the 
smaller labor-supply response. 

Comparing the labor-supply profiles for consumers I and II also 
involves a parametric wage change. As illustrated in figure 1, con- 
sumer II’s wage profile exceeds consumer I’s by a constant fraction 
over the entire life cycle. The parametric wage change associated with 
moving from consumer I to consumer I1’s wage profile is analogous to 
increasing the value of the intercept of the lifetime wage path, 77. 
This has two effects on labor supply in each period. First, a consumer 
adjusts his value of F in response to the profile shift. According to 
equation (19), F declines by an amount equal to Yo = ey y(t) times 
the increase in the value of 9. This decline in F implies a fall in hours 
of work at each age. Second, there is a direct impact on each period’s 
labor supply. Holding the value of F constant, a consumer increases 
his hours of work by an amount equal to 6 times the increase 1N 7. 
The implied total impact on each period’s labor supply, therefore, is 
Yo + 5 times the change in 7. Because Yo is unambiguously negative, 
there is no sign prediction for this total impact. Since yp + 6 is less 
than y(t’) + 6, however, the response of labor supply to a shift in 7 
should be less in algebraic value than the response to a shift in the 
wage profile only at age t’. The wealth effect associated with a shift in 
mis greater. The labor-supply profile for consumer II, then, can lie 
above or below consumer I’s labor-supply profile. It will lie above 
consumer I’s if Yo + 6 is positive. 

The empirical specification of life-cycle supply given by equations 
(12) and (19) provides a convenient framework for estimating the 
response of labor supply to the different kinds of wage changes 
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described above. Estimation of the \ constant labor-supply equation 
produces an estimate of the intertemporal substitution elasticity 6. 
This elasticity can be used to predict the response of labor supply to 
evolutionary wage changes; it provides the information one needs to 
describe a consumer’s dynamic behavior. Estimating the equation for 
F given by (19) produces estimates of the parameters yo, y1, y2, and 0. 
These estimates provide the additional information one requires to 
predict the response of labor supply to parametric wage and wealth 
changes, and they can be used to explain labor-supply differences 
across consumers. Combining the estimates of yo, y,, and y». with the 
estimate of 6 allows one to predict how labor-supply profiles adjust to 
changes in the wage-path coefficients ao, 771, and 7». This includes 
both shifts and slope changes of the wage profiles. The estimate of 6 
provides the information one needs to predict the response of labor- 
supply profiles to changes in a consumer’s initial permanent income. 
Estimating the empirical model proposed in this paper, then, fully 
characterizes a consumer’s lifetime labor-supply behavior. 


Three Substitution Elasticities 


Nowhere in the above interpretation of parameters was there any 
mention of compensated substitution elasticities. In terms of the 
above notation, it can be shown using Slutsky’s equation that the own- 
and the cross-compensated elasticities are 6 + y(t) — E(t)@ and y(t) — 
E(t)6, respectively, where E(t) =N(t)W(t) is real earnings at age ¢t.18 In 
the analysis of life-cycle behavior, it is important to distinguish sharply 
compensated elasticities from the intertemporal elasticity (i.e., 8) and 
the uncompensated elasticities (i.e., own effects 6 + y[t] and cross 
effects y[t]) discussed above. 

Some researchers incorrectly infer that intertemporal and compen- 
sated substitution elasticities are the same.!® The fact that an indi- 


'® According to Slutsky’s equation, 














AN(t)| _ aN(2) aN(t) 
aW(dlv  AW)lam  ’ aacoy 
So 
W(t) ON(t)| _ W(t) AN(t) dln N(t) _ 
N)OWlo NOW Olay © ONO Ga@ =F + 1 — EOS. 








* There has been some confusion concerning the interpretation of wage coefficients 
estimated in labor-supply studies such as those of Ghez and Becker (1975) and Smith 
(1977), who use synthetic cohort data. Because these studies estimate the response of 
hours of work to life-cycle wage growth, the wage coefficient is an intertemporal 
substitution elasticity. While this elasticity constitutes an upper bound for compensated 
and uncompensated elasticities, it is not one of the familiar elasticities associated with 
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vidual is at the same level of lifetime utility at all ages in a world of 
perfect certainty suggests that responses in hours of work to changes 
in the wage rate over the life cycle represent compensated substitution 
effects, which in turn suggests that the intertemporal and compen- 
sated elasticities are equivalent. These elasticities, however, are used 
to predict labor-supply responses to different kinds of wage changes. 
As discussed above, intertemporal elasticities determine hours of 
work responses to evolutionary wage changes. Compensated elas- 
ticities, on the other hand, are like uncompensated elasticities in the 
sense that they determine responses to parametric wage changes (i.e., 
wage changes due to shifts in wage profiles rather than movements 
along these profiles). Compensated elasticities, then, can be used to 
predict differences in hours of work across consumers whose wage 
profiles are different and whose lifetime utility is the same. 

Formally, the intertemporal substitution effect can also be inter- 
preted as an elasticity that is associated with a particular kind of 
parametric wage change. In particular, it determines the response of 
hours of work at age ¢ to a shift in the age ¢ wage rate holding d or the 
marginal utility of wealth constant. Thus, whereas uncompensated 
elasticities hold financial wealth constant and compensated elasticities 
hold lifetime utility constant, the intertemporal elasticity is equivalent 
to a parametric wage elasticity that holds \ constant. In the literature 
on consumer demand, this particular elasticity is known as the specific 
substitution effect.?° This correspondence between intertemporal and 
specific substitution effects is a direct consequence of the assumption 
that utility is additive over periods. 

While the three types of substitution elasticities are related, they are 
distinct and reduce to the same value only if income or wealth effects 
are zero. Assuming leisure is a normal good in all periods, these 
elasticities can be ordered as follows: 6 > 6 + y(t) — E(t)@ > 6 + y(t); 
that is, intertemporal responses are greater than compensated re- 
sponses which are in turn greater than uncompensated responses.”* It 


parametric wage changes. Thus, without imposing nontrivial restrictions on prefer- 
ences, wage coefficients estimated using synthetic cohort data cannot be used for policy 
analysis of the sort encountered in predicting labor-supply responses to negative 
income tax experiments or other proposed changes in tax policies. Elsewhere, Smith 
(1975) uses these estimated coefficients for exactly this purpose, namely, to predict the 
effect of income-maintenance programs on hours of work. 

20 See Phlips (1974, pp. 47-50) for further discussion. 

21 We can relate these elasticities using solutions of what is known as the fundamental 
matrix equation in the literature on consumer demand (see, e.g., ibid.). For leisure 
demand, it can be shown that: (1) the intertemporal effect is 


OL(t) | _ ) 2 
OW(t) |r ; 





where p22? is the (2, 2) element of the inverse of the hessian matrix of U[C(t), L(t)]; (2) 
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is also easy to show that intertemporal and compensated responses 
must be positive, while uncompensated responses may be either posi- 
tive or negative. Intertemporal and uncompensated substitution ef- 
fects, then, provide an upper and a lower bound for compensated 
substitution elasticities. 


IV. Empirical Analysis 


The A constant hours of work function given by (12) and the struc- 
tural equation for F given by (19) provide a manageable empirical 
model for analyzing labor-supply behavior in a life-cycle setting. This 
model naturally suggests a two-step estimation procedure. In the first 
step, one estimates the parameters of the » constant labor-supply 
equation. This step provides an estimate of the intertemporal sub- 
stitution elasticity and all the information a researcher requires to 
predict how the hours of work of a given consumer will differ at two 
points in time. In the second step, one uses estimated “individual 
effects” as dependent variables to estimate the structural equation for 
F which produces estimates of wealth effects and uncompensated 
substitution elasticities associated with shifts in the intercept and the 
slope of the lifetime wage path. This step provides the additional 
information one requires to predict how labor supply will differ 
across consumers. Using results from both steps, it is possible to 
compute the average own- and cross-uncompensated substitution 
elasticities associated with wage changes in a single year. These results 
also permit the calculation of upper and lower bounds for compen- 
sated elasticities. This two-step estimation procedure exploits the spe- 
cial characteristics of panel data to characterize life-cycle behavior 
with a minimal amount of computational burden.”2 


the compensated effect is 
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So, if L(t) is a normal good and U is concave, we have 


aL (t) dL (t) aL(t) 
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f 
The proposed restrictions on labor-supply elasticities follow immediately using this 
result. 

” There is an alternative strategy for estimating this structural model of lifetime 


labor supply. Instead of the procedure suggested above, one could use a one-step 
procedure. Substituting the right-hand side of eq. (19) for F directly into the A constant 
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The empirical work reported here uses the randomly designed 
sample from the Michigan Panel Study of Income Dynamics. The 
sample consists of observations of 513 prime-age, white, married 
males for the years 1967—76. Only males continuously married to the 
same spouse during the period 1968-77 and who were 25-46 years 
old in 1967 were included in the sample.”* The labor-supply variable 
used in the empirical analysis is annual hours of work. The wage 
variable is average hourly earnings deflated by the Consumer Price 
Index. 

To avoid confusion in this section we must be careful to distinguish 
between age variables and indexes representing a sample period. As 
in the previous discussion, ¢ denotes the age of a consumer; in par- 
ticular, t = 0 when the consumer is 25 years old, t = 1 when he is 26, 
etc. The index, on the other hand, denotes the sample period; 7 = 1 
when the observation is from year 1967, 7 = 2 when it is from 1968, 
etc. The following analysis assumes that there are a total of rt sample 
periods (7 = 10 for the sample used in this paper). Finally, the 
notation t(j) denotes the consumer’s age in sample period j. Thus, In 
W(t) is consumer 2’s wage at age ¢t, In W;(/) is his wage in period 7, and 
In W,(t) = In W{t(j7)] = In Wy). 


Estimates of the Intertemporal Substitution Elasticities 


Estimating the parameters of the d constant labor-supply equation is 
simplified by working with a first-differenced version of this equation. 


labor-supply equation given by either (12) or (13) produces a new equation containing 
all of the structural parameters of interest. It is possible to estimate this equation in a 
single step using constrained simultaneous-equation estimation procedures. The two- 
step procedure offers advantages over this one-step method when there exists mis- 
specification of the equation for F. The presence of any misspecification errors in the 
F-equation leads to inconsistent estimators for all parameters of the labor-supply 
equation in the case of the one-step procedure. For the two-step estimation scheme, 
however, only the second-step estimators for the parameters of the F-equation are 
inconsistent. 

Houthakker and Taylor (1970, chap. 5) and Phlips (1974, pp. 190-93, 250-60) have 
estimated marginal utility of wealth constant demand functions as an intermediate 
computational step toward estimating a system of ordinary demand functions. In 
contrast to their work, I directly estimate the A constant demand functions as a means of 
characterizing a consumer’s dynamic behavior. I treat ) as a fixed effect which sum- 
marizes the effect of historic and future information on current decisions. I thank 
James Heckman and Orley Ashenfelter for the Houthakker-Taylor reference. 

23 A worker had to satisfy the following criteria as well to be included in the sample: 
(1) He must be classified as employed or unemployed (i.e., permanently disabled and 
retired were deleted). (2) Wage and labor-supply data must be available for all years. (3) 
A worker must report less than 4,680 hours worked per year. The absolute value of the 
difference in his real average hourly earnings in adjacent years cannot exceed $16 or a 
change of 200 percent. The absolute value of the difference in the number of hours he 
works in adjacent years cannot exceed 3,000 hours or a change of 190 percent. The 
purpose of this last criterion is to minimize difficulties arising from the presence of 
outliers. : 
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First differencing equation (12) yields 
Din Nig) = olp —7())| + OD ly Wah) ef) fe 2c oo) 


where D is the difference operator, that is, D In N{7) = In Nj) — In 
Nj — 1), and ej) = Du((j) is a disturbance. This equation relates a 
consumer’s labor-supply changes to changes in his wage rate. Notice 
that differencing eliminates individual effects, and thus it avoids the 
introduction of incidental parameters. Step one of the estimation 
procedure is to use this equation to estimate the intertemporal sub- 
stitution elasticity, 6. 

Combining these equations for a given worker into a single system 
of simultaneous equations creates a model that is particularly well 
suited for an empirical analysis. Stacking equation (22) for worker 2 
according to the j index yields 


D \n N,(7) B, D \In W(7) €(7) 
D In Nr — 1) Oa D \In Wir — 1) e(7 — 1) 
= |. + : OM : > (23) 

D In N((3) Bs D In N((3) €,(3) 
D In N((2) Bo D In N,(2) €,(2) 
where 8; = dlp — r(y)],j = 2,..., 7. Putting this system into vector 
notation, we have 

DinN;=B+DInW6§ + «€, ne Ste (24) 
where D In N,, B, D In N;, and e, are (rt — 1) X 1 vectors with D InN; = 
[DiniN GG), 2..5D InN), Bo = (B,,<- 2, Bs) 3D In W7 = WD In Wa), 
...,D InW((2)], & = [e(r), .. . , €(2)], and n is the total number of 


workers in the sample. The following analysis assumes that the error 
vectors €; are independently distributed across individuals once com- 
mon time effects are removed with the inclusion of year dummies in 
the labor-supply equations. No restrictions are imposed on the 
covariance matrix of €;, which permits arbitrary forms of serial corre- 
lation. 

The parameters of equation (24) are estimated using standard 
two-stage and three-stage least-squares procedures which permit the 
imposition of equality constraints across equations.* The wage- 
growth variables, D In Wj), 7 = 2,..., 7, are treated as endogenous 
variables. The set of instruments used to predict D In W,()) includes 


** The two-stage procedures must account for the fact that the covariance matrix of 


the ¢;'s is not proportional to the identity matrix when computing standard errors for 
the estimates. 
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family background variables,2* education, age, interactions between 
education and age, and dummy variables for each year of the sam- 
ple. Estimation of (24) using simultaneous equation methods takes 
advantage of the time-series aspect of panel data to estimate the 
intertemporal substitution elasticity 6 with a minimal amount of com- 
putational burden. These methods avoid biases arising from pure 
reporting error in earnings and hours of work, and they offer a 
flexible framework for testing and estimating alternative functional 
forms. 

Another equation that can be used to estimate 8 is one that relates 
changes in hours of work to changes in earnings. Adding § - D In Nw) 
to both sides of (24) and solving this new equation for D In NJ) yields 








Din qj) = 6" 8p ney) + 
j= 


eee kT (25) 


where Ej) = N«j)W(j) is real earnings in period j. Stacking these 
equations creates a model like (24), except that D In W,, 6, and e; are 
replaced by D In E,, 6/(1 + 8), and [1/(1 + )]e;, respectively, and the 
elements of B become 8; = {8[p — r(y)]}/(1 + 8),7 = 2,...,7. Exactly 
the same procedures described above for estimating equations (24) 
are applied to estimate the parameters of the stacked representation 
of (25). Using the coefficient on earnings, 8/(1 + 8), it is possible to 
construct an estimate of 6. 

Table 1 presents estimates of the intertemporal substitution elastic- 
ity. Two specifications of the labor-supply equation are considered. 
One assumes that the interest rate, r(t), is constant over time, and it 
constrains intercepts in the labor-supply equations (i.e., the elements 
of B) to be equal over the sample. The other allows r(t) to be different 
in each period by including dummy variables for each year without 
any constraints on their coefficients, which permits intercepts to be 
different each period. These alternative empirical specifications yield 
similar results. All of the implied estimates of the intertemporal sub- 
stitution elasticity are positive. According to the estimates of the wage 
coefficients, 6 lies in the range .10—.23. The earnings coefficients 
indicate a range of .25-.45 for 6. The earnings coefficients indicate a 


25 Family background variables include the education (in years) of both the father 
and the mother of the consumer and dummy variables indicating parents’ economic 
status at the time the consumer was growing up. Both education and education squared 
and interactions between these two education variables and age are included as instru- 
ments. Coefficients in the “prediction equation” for wage growth are constrained to be 
equal across time periods. Formal hypothesis tests accept this restriction. In the appli- 
cation of constrained three-stage least squares, the wage-growth equations are not 
treated as part of the simultaneous-equation system. 
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TABLE 1 


SIMULTANEOUS-EQUATION ESTIMATION OF FIRST-DIFFERENCED LABOR-SUPPLY-EQUATION 
ESTIMATES OF THE INTERTEMPORAL SUBSTITUTION ELASTICITY 











Estimation Average of 
Procedure D (Log Wage) D (Log Earning)* Intercept Year Dummies 
2SLS 223 e —.009 
(2.42) (4.02) 
BSS 14 eae —.008 
(1.97) (4.26) 
25185 ares .35 —.006 
(2222) (4.26) 
3SLS ae 25 —.006 
(2.63) (5.18) 
2SLS 15 ade no —.008 
(.98) 
3SLS .10 ae sae —.008 
(.80) 
25169 on 45 see —.007 
(1.54) 
3SLS Sy. .30 my: —.007 
(1.67) 





Note.—Absolute values of t-statistics are in parentheses. 

* The estimates and ¢-statistics reported in the “Log Earning” column are for 6; they are computed using the 
coefficient on earnings, denoted 4, and its ¢-statistics. We have 6 = /(1 — #). To convert the t-statistics reported for yf 
to those for 6 requires division by the quantity (d6/dy) = (1 + 8)? evaluated at 6 = 6. 


higher estimate for 6 in all cases, but these differences are small 
relative to their standard errors. The estimates of the intercepts 
indicate that the real rate of interest exceeds the rate of time prefer- 
ence on average by about 2—4 percentage points. 

A comparison of these results with others in the literature is 
difficult since most studies use cross-section data for their empirical 
analysis where differences in lifetime wage paths are the primary 
source of wage variation across observations. As a consequence, they 
do not estimate the intertemporal substitution elasticity. Estimating 
equation (13) using cross-section data is complicated by the presence 
of individual effects, F;. As discussed above, economic theory implies 
that F; is correlated with a consumer’s wages and all of his other 
characteristics. Therefore, it is not reasonable to assume that F; is a 
“random factor” uncorrelated with explanatory variables. One, then, 
cannot directly use observations on individuals from a cross section to 
estimate the parameters of (13) even if one uses simultaneous equa- 
tion estimation procedures. Such procedures implicitly treat indi- 
vidual effects as random variables, and this leads to inconsistent 
parameter estimates. To estimate the intertemporal elasticity using 
cross-section data, one requires a specification of the labor-supply 
equation where variation in wages reflects evolutionary wage change. 
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One approach constructs synthetic cohorts to estimate equation (13) 
using cross-section data as implemented by Ghez and Becker (1975) 
and Smith (1977). A synthetic cohort is constructed by computing 
geometric means of wage rates and hours worked for each age group, 
and it is assumed to represent the life cycle of a typical individual. The 
basic assumption underlying this approach is that there are no cohort 
effects, so that group individual effects (i.e., the average of the Fjs 
for each age group) are the same for all age groups after adjusting 
for “smooth” vintage effects. In this case, least-squares estimation 
of equation (13) using synthetic cohort data produces consistent 
parameter estimates.”* This approach allows for measured “taste- 
shifter” variables such as family size and age. In principle, this ap- 
proach accounts for the endogeneity of wages and measured charac- 
teristics by using group averages as instruments. The problem of 
treating group effects as random variables does not arise since it is 
assumed that they are the same for all age groups. 

The estimates of the intertemporal substitution elasticity obtained 
by Ghez and Becker and Smith are comparable to those estimates 
presented in table 1 of this paper. Becker forms synthetic cohorts 
using the 1960 U.S. Census, and he obtains estimates for 6 ranging 
from —.068 to .44.?7 Smith, on the other hand, treats the family as the 
relevant decision unit and uses the 1967 Survey of Economic Oppor- 
tunity to form his synthetic cohorts. He estimates 5 to be about .32.28 
Comparing these estimates of 6 and those reported in table | suggests 
that cohort effects do not seriously bias estimates based on synthetic 
cohort data. 


Estimates of Responses to Parametric Wage Changes 


Estimating the structural equation for F given by (19) provides the 
additional information we require to predict a consumer’s labor- 
supply response to parametric wage changes. Estimating this equation 
is not as difficult as it may first appear. It is true that all of the 
variables appearing in this equation (Fj, 70; 771i, 772i, and a) are not 
directly observable. But it is possible to construct observable quantities 
that have expected values equal to these variables. If one replaces the 
unobserved variables by their observed counterparts, one can employ 
standard two-stage least-squares procedures to estimate the structural 
parameters of interest. 


26 Ghez and Becker and Smith do not interpret their parameter estimates as those of 
aX constant labor-supply function. Given their log-linear specifications, however, In ) is 
absorbed into the intercept of their regression specifications. 

27 Estimates obtained from Ghez and Becker (1975, pp. 112, 114). 

8 Estimates obtained from Smith (1977, p. 244). 
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First consider the coefficients of the lifetime path for wages. Define 
the difference operator D, as D; In W,(j) = In Wig) — InWig — k). 
Applying this operator to the wage equation given by (15) and divid- 
ing the result by & yields 


D, In Wi) 


k = 74 + ml2t) —k]) + ee 


Subtracting D, In W{(2) from this equation and dividing the result by 
1/(27 — k — 3) creates a new equation: 


poet Titi Dil Gall, Sey cay Soe 
2) stp=aeo k 
DV) —_ DWV{2) 
(2) Kea Ro] ie 
Notice that the dependent variable of this last equation has mean 
equal to 7; Replacing 72; in the previous equation by this dependent 
variable allows one to create another observable variable whose mean 
is 771; Further substituting these two observable variables for 72; and 
7; in the original wage equation allows one to create a third measur- 
able variable whose expected value is 7;. Following this strategy and 
taking averages to use all the available data, consider the following 
definitions: 
Tiana : 
faite gern ee ae 


2 Here | 
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— 


7 


3 1 pa lnWiGal an 


Ty = 
J 


[2t(j + 1) - | (27) 


= 


i 


RT 


6 = => {In Wij) = tl) — walt DPY, (28) 


= 


= 


where 7 is the total number of sample periods. It can be shown that 
E(ani) = Tri, h = 0, 1, 2. This is an important result because the Tris 
are observable variables, and it is possible to use the 77;;’s as dependent 
variables in a simultaneous-equation analysis to estimate the 7p;’s 
consistently. 

Similarly, given observations on consumer 7’s income over the sam- 
ple period, one can construct the variable ao; using definition (28) with 
Yj) replacing In W,(j); do; then, can be used in a simultaneous- 
equation analysis to predict the intercept of the lifetime income path, 
ao, Which is a measure of consumer 7’s initial permanent income. 

An analogous strategy can be used to construct a measurable vari- 
able to serve as a proxy for F;. From equation (13) we see that an 
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average of the quantities In N,(j) — bt(j) — & In Wj) has an expected 
value equal to F;, but unfortunately, this average cannot be directly 
observed since it depends on unknown parameters b and 6. We have 
estimates of these parameters, however, from the first step of the 
empirical analysis. A logical alternative to the above average, then, is 
to form the variable 


agin lex eer 
ates > On N,(j) — bj) — 8 In W,(7)). 
j=l 
Asymptotically, F; has an expectation equal to F;. 
Collecting the above results, we have a complete simultaneous- 
equations model given by 


Te Mign + Nhs h = 0, 1, 2, (29) 
ay = Sido + Ns, (30) 
F, = b + Too + Tuy + Foye + Aoi + 14, (31) 


where the vectors of exogenous variables M; and S; and the coefficient 
vectors £0, 21, Z2, and qo are defined by (16) and (18), and the 7,’s are 
disturbances. We have one set of equations for each consumer 7. The 
endogenous variables in this model are F;, @;, and the 7,;’s; the 
exogenous variables are the elements of M; and S;; and the structural 
parameters of interest are Yo, ¥:, Y2, and 6. The vectors M, and S;, 
contain variables determining the coefficients of the lifetime wage 
and income paths. In the following empirical analysis, they include 
the consumer’s education, his education squared, and family back- 
ground variables.?9 

To estimate the parameters of the structural equation for F given 
by (31) consistently, one can employ a standard two-stage least- 
squares procedure. The standard errors reported by this procedure 
are valid if the number of time-series observations for each consumer 
is sufficiently large.%° 

Table 2 presents estimates for the structural parameters of the 
individual effects equation given by (31), where F; is computed using 
three different sets of estimates for ) and 6. The estimates of Yo, 71, 


79 Family background variables include the education of both the father and the 
mother of the consumer and dummy variables indicating parents’ economic status at 
the time the consumer was growing up. 

°° If one does not have a sufficiently large number of these time-series observations, 
however, the usual standard errors are invalid. The problem lies in the fact that we 
use estimated values for b and 6 to form the dependent variable F;. In cases where the 
number of time-series observations is small, one must adjust the usual standard errors 
to account for errors in estimating b and 6. The precise form of this adjustment can be 
obtained from the author upon request. While this adjustment is not complicated, it 
does require the use of matrix operations. This adjustment was very minor in every 
instance for which it was used in this study, which suggests that even 10 observations 
per person is large enough to neglect making any adjustments in standard errors. 
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TABLE 2 


SIMULTANEOUS-EQUATION ESTIMATION OF FIXED-EFFECTS EQUATIONS 











b, § Yo V1 Y2 0 Intercept 
at OS Se Oo) —10.47 —.00026 7.81 
(eS) (1.4) lz) (.48) (157) 
—.009, .15 = 07 —1.08 SH) —.0001 Heda 
(1.78) (1.65) (1.31) (.16) (137) 
O00 23 Sarit) —1.46 — 16.86 .0001 7.68 
(2213) (1.9) (1.4) (.2) (119) 





Nore.—Absolute values of ¢-statistics are in parentheses. The dependent variable F, (i.e., the proxy variable for F;) 
is computed using the estimates b and 6 reported in the stub column. Income is measured in thousands of dollars. 


and y» are all negative as theory predicts. These estimates decrease 
monotonically as one uses a higher estimated value of 6 to compute 
the proxy variable for individual effects, F;. Since uncompensated 
substitution elasticities associated with permanent wage changes are 
calculated by adding the estimate of 6 to the estimate of yo, an inverse 
relationship between the estimates of 6 and ¥» is required if uncom- 
pensated permanent wage elasticities are to remain constant for dif- 
ferent choices of 6 in computing F;. 

All of the estimates of the initial permanent income coefficient, 9, 
are statistically insignificant and very small. The measure of property 
income (i.e., Y;[¢]) used in this empirical analysis is total family income 
minus husband’s earnings in thousands of 1967 dollars. While the 
estimates of 6 are negative as theory predicts when low estimated 
values of 6 are used to construct F;, we see that the estimated effects of 
a change in income on hours of work are minute; a $1,000 increase in 
initial permanent income leads to at most a .026 percent decrease in 
hours of work. Several other measures of property income were tried 
in the empirical analysis;3! in every case the estimates obtained for 6 
were of the order of magnitude reported in table 2. 

Combining estimates of y) and 6 allows one to form estimates of 
cross- and own-uncompensated substitution elasticities and bounds 
for compensated elasticities associated with wage changes in a single 
year. Dividing the estimate of y) by the length of the working life 
produces an estimate of the average cross-uncompensated elasticity. 
Using results from the second row of table 2 and assuming a working 
life of 40 years implies a cross elasticity equal to —.0018. Adding the 
estimate for 6 (which is .15 for the second row of table 2) to this cross 
elasticity implies a value of .15 for the average own-uncompensated 


*‘ One measure of income excluded wife’s earnings. Others included imputed in- 
come for house ownership. Due to data limitations, it was not possible to include im- 
puted income from other forms of consumer durables, which certainly constitute a 
major component of a consumer’s nonwage income. 
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elasticity. Since the average own-compensated elasticity lies between 
the intertemporal and the average Own-uncompensated elasticity, we 
conclude that this compensated elasticity is also approximately equal 
to .15. Increasing a consumer’s wage rate in period ¢ by 10 percent, 
then, leads to about a 1.5 percent increase in his hours of work in 
period ¢ and approximately no change in his hours of work at other 
ages. 

Combining estimates of yo, y;, and y,. with estimates of § provides 
the information needed to predict a consumer’s labor-supply re- 
sponse to shifts in his wage profile. In response to a uniform 10 
percent increase in wages at all ages (i.e., a parallel shift in the log 
wage profile), the estimates of the second row of table 2 predict that a 
consumer will adjust his hours of work by an amount equal to (7) + 
§)10% = (—.07 + -15)10% = .8 percent at all ages. There is, then, a 
small positive response in a consumer’s labor supply to parallel shifts 
in his log wage profile. If the slope of a consumer’s wage profile is 
altered by changing the coefficient on the linear term (i.e., 77,;) by A 
percent, the estimates for y, and 6 indicate that his hours of work at age 
t change by (y,; + 6t)A% = (—1.08 + .15t)A%. Recall that t here 
measures a consumer’s age and takes a value of 0 when the consumer 
is 25 years old. Hence, hours of work decline at early ages (i.e., prior 
to age 32) and they increase at later ages in response to this sort of 
increase in the slope of the wage profile. The same is true when the 
slope of the wage profile is altered by changing the coefficient on the 
quadratic term (i.e., 77;) by A percent. Hours of work at age ¢ adjust by 
an amount equal to (—13.03 + .15t?)A%, which also implies a decline 
in hours of work at younger ages (i.e., prior to age 34) and an increase 
at older ages. 

The estimates above of uncompensated elasticities are generally 
consistent with results found in cross-section studies and the popular 
notion that the lifetime labor-supply curve of prime-age males is not 
very responsive to permanent wage changes. If, in a cross-section 
analysis, one purges wages and income of their transitory components 
using a simultaneous-equation method and interprets the estimated 
coefficients as some sort of “lifetime average” relationship, then one 
finds small positive estimates for wage coefficients and negative or 
zero estimates for income coefficients for prime-age males,*? which 
agrees with the results above. In contrast to previous work, however, 
the estimates above also indicate that to predict the response of labor 


32 Accounting for the endogeneity of In W;(t) and Y;(t) has been shown to have a 
significant effect on cross-section estimates. DaVanzo, Detray, and Greenberg (1976) 
find that treating both wages and income as endogenous variables leads to positive 
estimates for wage coefficients and negative or zero estimates for income coefficients 
for prime-age men, which are consistent with the empirical results reported above. 
Neglecting this endogeneity produces estimated coefficients with opposite signs. 
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supply to shifts in the lifetime wage path one must be careful to 
specify the particular shift involved and the time of the life cycle 
relevant for evaluating the response. The lifetime labor-supply curve 
for prime-age males is backward bending for some types of wage 
changes over part of the life cycle, and it is positively sloped for other 
age ranges and wage changes. 


V. Conclusion 


This study formulates a manageable empirical model of labor supply 
that fully incorporates life-cycle considerations. This model naturally 
divides the analysis into two steps. In the first step, the analysis 
concentrates on measuring parameters relevant for describing a con- 
sumer’s dynamic behavior. Here one estimates the response of hours 
of work to evolutionary wage changes. In step two, the analysis fo- 
cuses on measuring parameters relevant for explaining differences in 
labor supply across consumers. This step produces estimates of the 
impact of parametric changes in wealth and in wages on hours of 
work over the life cycle. This two-step analysis offers a very tractable 
estimation procedure with minimal data requirements. 

We have seen that there are three types of substitution elasticities 
relevant for predicting the response of hours of work to changes in 
the wage rate. The existence of these three elasticities reflects the fact 
that the effect of a wage change depends on its source. The intertem- 
poral elasticity determines the labor-supply response to wage changes 
resulting from life-cycle wage growth and movements over a perfectly 
foreseen business cycle. Uncompensated and compensated elasticities, 
on the other hand, determine the hours of work response to shifts in 
wage profiles. When specifying these latter elasticities one must iden- 
tify not only the particular wage profile shift involved but also that 
part of the profile that is being held constant. While the three types of 
substitution elasticities are distinct, they are related with intertem- 
poral and uncompensated elasticities providing an upper and lower 
bound for compensated elasticities. 

This paper presents a full set of estimates required to describe the 
lifetime labor-supply behavior of prime-age males. Estimates of the 
intertemporal substitution elasticity indicate that a 10 percent increase 
in the real wage rate which is due to life-cycle wage growth induces a 
1—5 percent increase in hours worked. The estimates of own-period 
uncompensated and compensated substitution elasticities range be- 
tween .1 and .5, and cross-uncompensated elasticities associated with a 
1-year wage change are approximately zero. Estimates of uncompen- 
sated elasticities associated with shifts in the entire wage profile indi- 
cate that a uniform 10 percent increase in wages at all ages leads to a 
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0.5—-1.3 percent increase in hours of work, and an increase in the 
slope of the profile leads to a decline in hours of work at early ages 
and an increase at later ages. The interpretation of these empirical 
results is, of course, dependent upon strong theoretical assumptions. 
The analysis in this paper neglects the presence of taxes, and it 
assumes that capital markets are perfect. There is an implicit assump- 
tion that hours of work are perfectly flexible and chosen freely by the 
worker. This paper also ignores the role of human capital investment 
in measuring both the supply of labor and the returns to work. 
The important point for an analyst to extract from this study is the 
following: Recognizing that individuals make their decisions in a 
life-cycle setting is crucial if one’s objective is to estimate economically 
meaningful parameters. Creating an empirical model that accounts 
for such a setting need not complicate the analysis, and it generally 
leads to a more complete understanding of consumer behavior. 
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Output Effects of Government Purchases 
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The theoretical analysis focuses on the distinction between tempo- 
rary and permanent movements in government purchases. Under 
plausible conditions, the temporary case involves an output response 
that is positive, less than one-to-one with the change in government 
purchases, and larger than that generated by an equal-sized, but 
permanent, shift in purchases. The equilibrium real rate of return 
rises in the temporary case, but changes little in the permanent one. 
Defense purchases are divided empirically into “permanent” and 
“temporary” components by considering the role of (temporary) 
wars. No temporary shifts in nondefense purchases were isolated. 
Empirical results verify an expansionary output effect for temporary 
purchases that exceeds that of permanent purchases. The results for 
some other expectational hypotheses are found to be generally sup- 
portive of the theory. 


Macroeconomic analysis typically assigns government purchases an 
important role in influencing aggregate demand and thereby in af- 
fecting output and employment. Bailey (1971) points out that these 
expansionary effects are offset to the extent that governmentally 
provided goods and services are close substitutes for private con- 
sumption expenditures. Hall (1980) argues that temporary changes in 
government purchases can have a substantial business cycle role, 
because they stimulate intertemporal substitution of work and pro- 
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duction. These effects are most important in the case of transitory 
expenditures that are not close substitutes for private spending— 
notably for wartime spending—but would not apply to long-run 
changes in government purchases. Public services also play a role as 
an input (perhaps usually with positive marginal product) into private 
production processes. This characteristic provides a direct channel 
whereby shifts in government purchases can alter the level of total 
output. 

The present analysis focuses on the theoretical and empirical dis- 
tinction between temporary and permanent variations in government 
purchases. A simple theoretical framework is used to analyze the 
output and real-rate-of-return effects of these purchases. It is argued 
first that movements in the real rate of interest arise mainly when 
government purchases are temporarily high or low and, second, that 
the response of output is likely to be larger when the change in 
purchases is temporary rather than permanent. 

The empirical section estimates the division of defense purchases 
into permanent and temporary components by considering the ef- 
fects of war and of war expectations. Defense spending associated 
with wars is largely transitory, while other changes in defense spend- 
ing turn out to be predominantly permanent. Shifts in nondefense 
federal plus state and local purchases are also mostly permanent in 
character. 

Analysis of real GNP reveals a significant expansionary effect of 
temporary defense purchases. Permanent defense purchases have a 
significantly weaker, but still significantly positive, effect on real GNP. 
The coefficient associated with permanent nondefense purchases is 
imprecisely determined, which prevents concluding either that these 
output effects are nonzero or that they differ significantly from those 
produced by permanent defense spending. Because no temporary 
changes in nondefense purchases were isolated, it was not possible to 
determine the output effects from this category of purchases. Some 
more detailed hypotheses that concern the generation of expected 
long-run average defense purchases are formulated, tested, and ac- 
cepted. Finally, the determination of real GNP during World War II 
is analyzed and compared with relationships for the postwar period. 


I. Theoretical Considerations 
Setup of the Model 


This section constructs a simple theoretical framework, which is used 
to study the effects of government purchases on output and the real 
rate of interest. The setting is designed to focus on the distinction 
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between temporary and permanent movements in government pur- 
chases. 

Suppose that the economy-wide credit market establishes an antici- 
pated real rate of interest on loans, which is denoted by r. For simplic- 
ity, economic agents are assumed to act as if r were constant over time. 
The model can be extended to permit (valid) expectations in some 
circumstances of divergences between current and anticipated future 
values of r (see below). 

An increase in r motivates the postponement of consumption and 
leisure from the present to the future. Therefore, r has a negative 
effect on consumption demand, C4, and a positive effect on the supply 
of labor services.1 The positive effect on labor supply implies— 
through the equilibration of a labor market that is not considered 
explicitly—a positive effect of ron commodity supply, Y*. For present 
purposes, it is satisfactory to ignore the intervening factor markets 
and view households and producers as integrated economic units. In 
this case, the rise in r directly boosts current supplies of goods and 
services. A higher value of r tends also to deter the accumulation of 
capital. However, in order to simplify matters, the discussion of in- 
vestment effects is limited to footnotes in the subsequent analysis. 


The Government's Budget 


The government's real demand for commodities during period ¢ is 
denoted by G,. Let 7; represent the real value of date t’s tax collec- 
tions, net of any transfer payments. Taxes and transfers are treated 
initially as lump sum in nature, but this assumption is relaxed later. 
Inflationary finance can be viewed as a particular form of (non-lump- 
sum) tax, which need not be introduced separately for present pur- 
poses. The model does not deal with monetary variables or the de- 
termination of the absolute price level and the nominal interest rate. 
These matters do not seem central for a study of the real effects of 
government purchases. 

Abstracting initially from interest-bearing public debt, the govern- 
ment’s budget constraint requires an equality each period between 
purchases and the net amount of real taxes: 


Cpa Ts (1) 


The possibility of government borrowing relaxes this condition of 
budget balance each period, but does not alter the principal findings 
that are discussed below. 


‘A change in r involves pure, unambiguous substitution effects if the underlying 
production technology is held fixed (see Bailey 1971, chap. 6). 


GOVERNMENT PURCHASES 1089 


In calculating its permanent income, the representative household 
figures in the anticipated present value of its share of taxes-net-of- 
transfers. For aggregate purposes, the important magnitude at some 
starting date 0 is the expectation of the present value, 2%, [T;/(1 + r)‘]. 
From equation (1), this magnitude coincides with the expected pres- 
ent value of government purchases, 22, [G,/(1 + r)‘].2 It is conve- 
nient to work with the uniform flow of purchases, G, that would yield 
the same present value of purchases as the time path, G,. This flow, 
which is referred to as “permanent purchases,” is determined from 
the condition 


G=r a [G/Q +r)‘. (2) 
f=1 


Holding fixed any service value that the private sector attaches to 
the tme path of G-—which is discussed next—a rise in G impacts on 
households exactly as would a corresponding decrease in permanent 
income. In particular, an increase in permanent purchases, G, tends 
to reduce C4 and raise the supply of labor services at all dates. (The 
positive response of work effort depends on the lump-sum nature of 
taxes. See the subsequent discussion.) The increase in work offers 
translates—through the equilibration of the labor market or via the 
direct behavior of household/producers in the present frame- 
work—into increased commodity supply, Y*. 

For a given value of 7, a rise in G requires a one-to-one decline in 
the representative household/producer’s “average” planned value 
over time for C4 net of Y’. This result follows from the intertemporal 
budget constraint for a household/producer, where the time path of 
C4 and net real taxes appear on the expenditure side, and the time 
path of Y* appears on the income side. If the current value of C4 net of 
Y® falls by less than one-to-one with G, then the typical household 
must be planning to reduce some future net values by greater than 
one-to-one. That is, the household responds in this case to the drop in 
effective permanent income by shifting relative expenditures (on 
consumption and leisure) from the future to the present. Similarly, a 
decline in current values by more than one-to-one with G would 
signify an intention to shift expenditures from the present to the 
future. Since.a pure income effect is involved, it seems reasonable to 


2 With public debt included, the expected present value of net real taxes equals the 
expected present value of real purchases plus the initial amount of real government 
debt. In particular, there is still a one-to-one relation between changes in the antici- 
pated present values of real purchases and net real taxes. This calculation assumes that 
the government's real interest rate equals r. The possibility of chain-letter, perpetual 
deficit finance has also been excluded. See Barro (1978) for a discussion of these and 
related matters, including the role of finite lifetimes. 
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concentrate on the intermediate case where the intertemporal pattern 
of expenditures is unchanged. In this case, at a given value of 7, the 
decline in current C4? net of the increase in Y* exactly matches the rise 
in G.3 


The Role of Public Services 


The government is viewed as utilizing its commodity purchases at 
each date, G,, to provide a contemporaneous flow of public services to 
the private sector. These services are treated as provided free of 
charge to household/producers. Two types of services are considered. 
One form is modeled as a direct conveyer of utility to households. 
Examples which do not encompass the traditional roles of govern- 
ment include parks, libraries, school lunch programs, (subsidized) 
hospitals, and, possibly, highway and transportation programs. (The 
latter category could be viewed alternatively as an input to private 
production functions.) An important feature of these forms of public 
services is the possibility of close substitution with private consumer 
spending. 

The second type of service is an input to private production pro- 
cesses, which can apply either to businesses or households. Examples 
include the provision of a legal system, aspects of national defense, 
fire and police services, education, and various regulatory activities. 
(The last item is likely to exhibit negative marginal product.) In some 
cases these services would be close substitutes for private inputs of 
labor and capital. However, in areas like the provision of a legal 
system and national defense, the public services are likely to enhance 
the marginal products of private factors. 

In many situations a particular government activity would exhibit 
features of both general types of public services that are being consid- 
ered. The extent to which each feature was represented would vary 
across a wide range of programs. Despite this real world diversity, the 
formal analysis proceeds as if there were a single type of govern- 
mental activity, which has service attributes that are partly of the 
direct-utility type and partly of the productive-input type. 

Government-provided services are often modeled as “public,” as 
opposed to private, goods in the sense of being “nonrival”—one 
person’s enjoyment of the good does not diminish the enjoyment by 
another person.‘ It is doubtful that this characteristic applies to the 


* This result does not depend on an infinite horizon for the representative household. 
Finite lives can alter the effects of some government actions, such as changes in public 
debt or social security, that involve a publicly mandated shift in incomes across genera- 
tions. Government purchases that are financed contemporaneously by taxes (which are 
independent of age) do not involve these considerations. 

*This characteristic is embedded in the theoretical analysis of Samuelson (1954). 
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majority of government purchases at the current time. Falling 
outside of this category would be the bulk of expenditures on educa- 
tion, hospitals, school lunch programs, and any service that is subject 
to congestion, such as parks, courts, libraries, transportation and 
highway projects, and police/fire services. Even in the case of national 
defense, the benefits to individuals are likely to be relative to the total 
amount of property that is being defended, because the level of 
external threat would respond to the potential prize from conquest 
(see Thompson 1974). 

In a nonrivalry situation, individual utility or production would 
depend on the total of government services rather than on the quan- 
tity provided to the particular economic unit. Because nonrivalry 
seems atypical, the modeling assumes that individual utility and pro- 
duction depend on real government purchases per capita. However, 
the general form of the analysis would not be altered appreciably if 
some elements of nonrivalry were introduced. 

Consider, first, the direct interplay in utility functions between 
government services and household choices of consumption and lei- 
sure. Suppose, as stressed by Bailey (1971, chap. 9), that the contem- 
poraneous levels of (per capita) G; and (individual) C; are close sub- 
stitutes in utility terms. For example, assume that each unit of G; (per 
capita) is viewed as providing utility services that are equivalent to a 
fraction 6 of a unit of contemporaneous individual consumption 
expenditure.® That is, household utility depends on the effective 
consumption flow at each date, C# = C; + 0G,;, where 0 < 0 < 1. The 
formulation neglects this type of utility substitution among noncon- 
temporaneous values for C and G or between G and leisure.® The 
provision of these types of public services means that households 
obtain units of effective consumption, C*, that exceed the quantity of 
private real expenditures, C. The permanent flow of government pur- 
chases, G, can be used to finance the uniform effective consumption 
flow, 0G.” This aspect of public services offsets the negative perma- 
nent income effect from G that was described earlier (see also n. 6 
above). The permanent income effect that is pertinent to private 
choices on consumption expenditure and leisure is now (@ — 1)G— 
the condition 0 < @ < 1 implies that the permanent income effect of G 
is still less than or equal to zero, but no larger than one in magnitude. 


> The parameter 6 can be viewed in the following analysis as applying to the marginal 
unit of G, 

6 The time path of G, could also affect overall household utility in a form that was 
additively separable from the time paths of effective consumption and leisure. In this 
sense the 9 parameter need not limit the utility value that households attach to public 
services. This possibility does not invalidate the subsequent discussion of permanent 
income changes that are induced by shifts in G. 

7 The discussion assumes that the inequality constraint, C,; > 0, is never binding. 
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With G held fixed, an increase in G, now implies some direct 
crowding out of contemporaneous private commodity demand. In 
order to maintain the level of effective consumption, C#—which is 
appropriate when G, the other determinants of permanent income, 
and r are held fixed—C? must decline with G; in accordance with the 
parameter 9. The greater the utility substitution (at the margin) be- 
tween C,; and G,—as measured here by 6—the larger the negative 
response of C# to an increase in G,. As long as 0 < 1 applies, aggregate 
commodity demand for date t—Y¥? = C# + G,—rises as the nonnega- 
tive fraction, (1 — 6), of increases in G, (when G is held fixed). 

Consider next the role of public services as an input to private 
production processes. It is assumed that public services of this type 
have a positive marginal product, which is denoted by MPG. The 
condition MPG < 1, which is assumed to hold, implies that the 
marginal response of (aggregate) private output to an increase in 
(aggregate) G does not exceed the social cost of providing the extra 
public services input. Through its role as a productive input, an 
increment in G raises commodity supply, Y%, for given levels of private 
factor inputs. Note that part of total output will be utilized to provide 
intermediate goods, which take the form of publicly supplied pro- 
duction inputs. Although there are good reasons in principle for 
deleting these intermediate goods from measures of final product, 
this approach is not followed in the national accounts.8 In the 
theoretical analysis the output measure, Y, is also gross of this type of 
intermediate production. 

Because the public-service inputs are provided freely, a change inG 
alters private sector real incomes in accordance with the marginal 
product, MPG. The representative household receives a per capita 
share of this extra real income. This effect further offsets the inverse 
influence of G on permanent income. The net effect now depends on 
the term (9 + MPG — 1), which is nonpositive but no greater than one 
in magnitude if 0 < 6 + MPG < 1 applies.® 

If variations in G alter the marginal products of private productive 
inputs, then additional effects would arise through changes in factor 
demand functions. Cases where public services substitute for private 
inputs—such as the provision of public rather than private 


* These matters are discussed in Kuznets (1948, pp. 156-57) and Musgrave (1959, 
pp. 186-88). The double-counting property for publicly provided production inputs 
implies that empirical counterparts of total output, like real GNP, overstate the re- 
sponse of final output to government purchases. 

* Recall that the analysis deals with a composite government service that has attributes 
of the direct-utility (6) and production-input (MPG) type. A particular category of 
purchases is unlikely to exhibit a high value for both parameters, 0 and MPG. There- 
fore, if 6 < 1 and MPG < 1 apply for each category of purchases, the condition @ + 
MPG §< 1 is likely to hold for the composite over all categories. 
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guards—would generate reductions in the marginal product of labor. 
On this count, the private demand for labor would tend to fall when G 
rises. In some other cases the private demand for capital services 
would decline. However, to the extent that background services like 
national defense and a legal system are expanded, factor marginal 
products are likely to rise, which would generate the opposite re- 
sponses in private factor demands. The main analysis neglects the 
array of possible effects of government purchases on private factor 
marginal products. 


Commodity-Market Clearing 
The equilibrium condition for the commodity market is given by 


Vl Gee ane. GidnG: eG <a, 7G. GY (3) 
(es) i) :(=s) (Ct) a te) 

Recall that the analysis has neglected the investment component of 
demand and has, thus far, assumed lump-sum taxation. Time sub- 
scripts have been omitted for convenience. Signs beneath the inde- 
pendent variables refer to partial derivatives. The omitted arguments 
in the C4 and Y* functions involve various fixed aspects of households’ 
permanent incomes, the production technology, and so on. As men- 
tioned before, the real rate of return, r, exerts intertemporal substitu- 
tion effects that are negative on C% and positive on Y*. A rise in G has a 
negative crowding-out effect on the contemporaneous choice of C%. 
The impact of G on Y* is nonnegative if MPG 2 0. A rise inG reduces 
the pertinent measure of households’ permanent incomes, as dis- 
cussed before, which leads to a decrease in C4 and a rise in Y*. 


Effects of a Temporary Rise in Government Purchases 


Consider a temporary expansion of real government purchases, 
where G rises while G is held fixed. The budget condition from 
equation (1) implies that these purchases are financed by a contem- 
poraneous increase in real taxes-net-of-transfers, T. In fact, for the 
context of a temporary rise in government spending—which is most 
pronounced at the federal level during wartime—it is more natural 
that the bulk of contemporaneous finance would take the form of 
interest-bearing debt issue rather than tax increases. This behavior 
allows the government to spread the higher taxes necessitated by 
temporary spending over a large time interval instead of implement- 
ing exceptionally high tax collections for a few periods (see Barro 
[1979, 19806] for discussions). With G held fixed, it would be possible 
to utilize interest-bearing debt so as to maintain the entire initial time 
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path of net real tax collections, T;. In any event, for the setting of 
lump-sum taxes, the present value of tax obligations is not altered by 
shifts between public debt and taxes (see n. 2 above). Since house- 
holds’ calculations of permanent incomes depend only on this 
present-value magnitude, the effects of increases in government pur- 
chases that are financed by debt issue would coincide with the effects 
of those that are financed by higher taxes. 

The rise in G reduces C¢ on the left side of equation (3) in accor- 
dance with the utility substitution parameter, 0. Therefore, aggregate 
demand, Y%, rises on net by (1 — 6) times the increase in G. If the 
marginal product of public services, MPG, is positive, Y* rises with G 
on the right side of equation (3). Overall, the shift in “excess de- 
mand,” Y¢ — YS, is determined by the term (1 — @ — MPG). It was 
already assumed that this term is nonnegative—that is, the direct, 
one-to-one effect of G on aggregate demand is offset only partially by 
the utility-substitution and productive-input aspects of government 
purchases. 

Since an increase in G raises excess commodity demand, a rise in the 
real rate of return is required in order to restore commodity-market 
clearing.’ This response in r reduces C@ and raises Y’. The rise in Y* 
reflects the substitution of current work effort for planned future 
effort. Since Y* was also increased directly by the rise in G, it is 
apparent that equilibrium output rises. This output effect is greater 
the smaller is the value of 6, the larger is the value of MPG, and the 
greater is the real interest rate elasticity of Y* relative to that of C4. In 
the polar case where 6 = 1 and MPG = 0, the response of Y and r toG 
would be nil. In this circumstance, government purchases would 
amount to lump-sum transfers to households, because G,and C, were 
perfect substitutes in the utility function. 

Private consumer spending, C“%, is crowded out from the rise in r 
and from the initial negative effect of G. Therefore, the positive 
response of Y to G must be less than one-to-one—that is, the model 
exhibits an output dampener rather than a multiplier. 

The positive response of output to temporary movements in gov- 
ernment purchases would apply especially to wartime periods.!! The 
higher real rate of return can be viewed as a price signal that induces 
the intertemporal substitution of resources toward periods such as 
wars in which aggregate output is valued unusually highly. This type 


10 Because the expansion of G is temporary, the rise in the equilibrium real rate of 
interest would also be temporary. An extension to allow divergences between current 
and expected future real rates of return does not alter the basic analysis. 

* Wartime may also be associated with uncertainties on maintaining property rights, 
which would tend to reduce private investment demand. The possibly changing proba- 
bility of winning or losing a conflict would enter in this context. The analysis abstracts 
from these effects and from controls on prices or interest rates. Also excluded are 
effects of patriotism or coercive behavior, such as conscription. 
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of substitution has been stressed by Hall (1980, Sec. 2), who points out 
also that this behavior differs in some important respects from the 
responses of supply to monetary misperceptions that occur in some 
business cycle theories that stress intertemporal substitution on the 
supply side (e.g., Lucas and Rapping 1969; Lucas 1975; Barro 1980a). 
The effect of temporary government purchases on the time arrange- 
ment of work and production does not rely on elements of misper- 
ception with respect to the general price level or other variables. 


Effects of a Permanent Rise in Government Purchases 


Suppose now that G and G rise by equal amounts. The increase in G 
was shown already to raise excess commodity demand in accordance 
with the term (1 — 6 — MPG). A higher value of G was shown earlier 
to reduce effective permanent income by the same factor. It was also 
noted above that the typical or intermediate response of C? net of Y* 
would be one-to-one with this type of change in effective permanent 
income. (This response arises when households reduce expenditures 
on consumption and leisure by the same amount in each period.) In 
this case the response of (C4? — Y*) equals —(1 — 9 — MPG) times the 
change in G. Since this response exactly offsets the excess demand 
effect of G, the overall response in (Y? — Y%) would be zero. It follows 
in this case that the real rate of interest is invariant with permanent 
changes in government purchases (under lump-sum taxation).” Al- 
though the invariance of r would not hold in general, this conclusion 
for an intermediate case contrasts with the presumption of a positive 
effect for the setting of a temporary increase in government pur- 
chases. 

When r is unchanged, it follows from equation (3) that consumption 
falls, and total output rises. The expansion of production reflects 
partly the direct effect of G on Y* (which was assumed to be positive) 
and the negative income effect of G on leisure. (The net response of 
leisure becomes ambiguous when income taxation 1s introduced—see 
the subsequent analysis.) The decline in consumption means that 
output rises by less than one-to-one with the permanent expansion of 
government purchases; that is, an output dampener is again pre- 
dicted. 


Non-Lump-Sum Taxation 
Some of the results are affected by the unrealistic assumption that 


government expenditures are financed by lump-sum taxation. This 


Tt follows also—assuming no direct effects on the marginal product of capital 
schedule—that capital accumulation would be unaltered. 
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section explores the consequences of income taxes in a simple envi- 
ronment. Because the present analysis is concerned only with tax 
effects that are systematically related to changes in government 
purchases—rather than with public finance questions, per se—this 
simplified analysis may be adequate. 

Suppose that government expenditures are financed by a general 
income tax. Let 7; represent the effective tax rate on incomes that 
accrue during period ¢.'* In a nonproportional tax setup, 7, would 
represent the average marginal income tax rate. Assume now that the 
government can borrow and lend at the real rate of interest, y—the 
same rate that applies to the private sector. Given the permanent flow 
of real government spending, which includes G and the comparable 
measure for real transfers,“ and the possibility for variations in public 
debt, there exists some income tax rate 7 that is constant over time and 
also just satisfies the government’s intertemporal budget constraint. 
Although many patterns of time-varying tax rates would also satisfy 
the government’s budgetary requirements, it will be desirable in most 
circumstances '!*&—in terms of minimizing the distortions that are im- 
posed on the private economy—to stabilize income tax rates over 
time. That is, the government would adjust its public debt issues and 
redemptions in order to prevent divergences between current and 
expected future income tax rates. (For discussions of this type of 
result, see Barro [1979, 1980c]; Kydland and Prescott [1980, pp. 
185—86].) In this setting changes in government purchases would not 
generate movements in current tax rates relative to expected future 
rates. This conclusion means that, first, variations in G with G held 
fixed have no effect on tax rates; second, shifts in G imply equal 
changes in current and expected future tax rates, 7;17 and third, for 
the purpose of studying government purchases, it is unnecessary to 
deal with the intertemporal substitution effects that would arise from 
expected time variations in income tax rates.18 

Given this framework for tax rate determination, it is unnecessary 
to modify the main conclusions that were derived earlier for the case 


*’ The analysis neglects the double taxation of incomes that flow through the corpo- 
rate sector. Taxation in the form of inflationary finance could be included separately 
without affecting the main results. 

‘4Real interest payments on an initial public debt stock would also enter. 

® Because changes in tax rates affect the tax base, Y, the solution for 7 is generally 
nonunique. However, the minimal possible value for 7 is the pertinent choice. 

“6 It may be optimal to allow tax rates to vary over the business cycle. A countercycli- 
cal pattern shows up empirically for the U.S. federal government. However, the pattern 
is at least less pronounced in terms of the total government sector. 

*’ It is assumed that increases in tax rates induce increases in real tax revenues within 
the relevant range. 

8 E.g., these types of effects are central to a study of the investment tax credit (see 
Kydland and Prescott 1977, pp. 482-86). 
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of a temporary increase in government purchases. Notably, output 
and the real rate of interest continue to rise when G increases, with G 
held fixed. The principal modification to the previous findings is that 
r should be interpreted as the after-tax real rate of return, as calcu- 
lated with the appropriate average marginal tax rate on interest in- 
come.?9 

For the case of a permanent increase in government purchases, the 
new element is the rise in the income tax rate, 7, along with the rise in 
G (see n. 17 above). The higher income tax rate motivates a shift away 
from market work and toward leisure and other nonmarket activities. 
In equation (3) this change is reflected as downward shifts in the Y’ 
and C“ functions.” For given values of r (now interpreted net of tax), 
G, G, and so on, it is plausible that the declines in C4 and Y* would 
roughly balance. In other words, as in some cases that were discussed 
earlier, there is no reason to expect a particular direction of change 
for the relative amounts of consumption and leisure expenditures 
that are conducted at different dates. 

Given this pattern of response to a higher income tax rate, it still 
follows that a permanent shift in government purchases has no effect 
on the (after-tax) real rate of return. However, the negative effect of 
higher income taxation on the incentive to work offsets the tendency 
for output to rise.” , The net movement in output now involves three 
forces: first, the substitution away from work because of higher in- 
come tax rates; second, the negative income effect (associated with the 
higher level of G) on leisure, which motivates more work; and third, 
the direct productive-input effect of G on Y*. The first two influences 
involve the standard ambiguous net response of leisure to the sub- 
stitution and income effects that are generated by either a change in 
the real wage rate or a shift in the income tax rate that is applicable to 
labor income. However, the income effect here involves the term —(1 
— 6 — MPG)G, while the tax shift applies one-to-one to G. This dif- 
ference increases the likelihood that the substitution effect will out- 
weigh the income effect. Suppose that the substitution effect were, in 
fact, comparable to or dominant over the income effect. In this case 
the overall change in output that is induced by a permanent rise in 


19 This result assumes that interest income is taxable and interest payments are 
deductible from taxable income. The conclusion neglects systematic differences be- 
tween the marginal tax rates applicable to receivers of interest income vs. those perti- 
nent for payers. The result is not affected directly by the taxation of nominal, rather 
than real, interest payments. However, other effects of inflation on effective tax rates 
would matter. 

20 The incentive to accumulate capital would also be diminished. ‘Through this chan- 
nel, a permanent increase in government purchases tends to reduce the capital stock, 
even when the after-tax rate of return is unchanged. 

21 A reduction in the capital stock, as mentioned in n. 20, reinforces this effect. 
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government purchases would be bounded from above by the direct 
positive effect of G on Y*. This channel corresponds to only one 
portion of the positive output response that arose for the case of a 
temporary increase in government purchases—the other part in- 
volves the intertemporal substitution effect on work effort, which is 
associated with the increase in r. It follows that temporary rises in 
government purchases would induce larger output responses than 
equal size, but permanent, rises in purchases. The sign of the output 
response is now ambiguous for the case of a permanent expansion of 
government purchases—the reaction is more likely to be positive 
when the marginal product of public services is high. 

Overall, temporary expansions of government purchases are dis- 
tinguished from permanent increases in that, first, the positive effect 
on output of the temporary expansion is likely to be larger and, 
second, a positive effect on the (after-tax) real rate of interest is pre- 
dicted only for the temporary case. The present empirical investiga- 
tion deals only with output effects of temporary versus permanent 
movements in government purchases. Some preliminary analysis of 
real-rate-of-return effects is carried out in Barro (1981a). 


II. Empirical Implementation 


The theoretical propositions will be tested by examining the effects of 
government purchases in a reduced-form relationship for output, as 
measured by real GNP. The analysis is an extension of previous 
empirical research (Barro 19816), which stressed the business cycle 
influences of monetary disturbances. This earlier work included a 
government purchases variable, but did not distinguish temporary 
from permanent government spending. 

It is convenient to carry out the analysis in terms of the ratio of real 
government purchases to real GNP, G/Y. In particular, temporary or 
permanent variations in G are assumed to enter relative to Y in a 
linear relation for the log of output—that is, 


log(Y,) =... + B(G —G)/Y], + B,(G/Y),, (4) 


where omitted variables indicated by . . . include current and lagged 
monetary shocks and other deterministic and stochastic influences on 
output. The variable G,, which would generally be unobservable, is 
the permanent flow of government purchases as perceived at date t. 
The empirical procedure for handling this variable is discussed below. 
Lagged values of G/Y and G/Y might also appear in equation (4), but 
these effects were not found to be important empirically. 

The functional form in equation (4) implies that increments in 
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government purchases induce increments in output in accordance 
with 


AY We piG — GIVE B.GIV]= Ba(G = G) + pdG. (5) 


Therefore, in regions where [B,(G — Cy a BGI] is small relative to 
one, the coefficients 8, and 8, would indicate the approximate re- 
sponse of Y to unit changes in (G — G) and G, respectively. The 
empirical results suggest that this approximation is satisfactory over 
the 1930-78 period in the United States, except for the World War II 
years. 

The theoretical analysis suggests testing the hypothesis, 0 < B, < B, 
=< 1. This restriction implies that temporary changes in government 
purchases have a larger output effect than permanent changes, but 
that permanent changes also raise (measured) real GNP. These con- 
ditions are likely, but not inevitable, implications of the theoretical 
model. 

A central aspect of the empirical analysis is the representation for 
permanent real purchases as perceived at date t, G;. Suppose for the 
moment that the time-series behavior of G; implies a relationship for 
(G/Y), in terms of a set of parameters a and a vector of currently 
observed variables, Z;: 


(GIY), = F(Z¢, @). (6) 


In this case the unobservable construct, G,, could be substituted out 
from equation (4) to yield a relation for output in terms of observable 
variables and the vector of unknown coefficients (a, 81, B2), 


log(Y,) =... + Bil(G/Y), — F(Zy a)] + BoF (Ze a). (7) 


Some hypotheses arise that concern the role of the Z; variables in 
equations (4), (6), and (7). If these variables can be guaranteed, ex 
ante, not to appear separately in the list of omitted elements that are 
denoted by . . . in equation (4), then the Z; variables would appear in 
equation (7) only to the extent that they serve as determinants for 
(G/Y), in the F-function of equation (6). Some cross-equation restric- 
tions therefore emerge for the parameters of equations (6) and (7). 

The next sections deal with the problem of modeling a form of 
equation (6) for real government purchases in the United States.” 


22 Levis Kochin has suggested the attractive alternative of using the current overall 
tax rate as a proxy for the anticipated, long-run average ratio of government purchases 
to GNP. The rationale for identifying the current tax rate with the anticipated govern- 
ment expenditure ratio was discussed in the theoretical section. Some problems with 
implementing Kochin’s suggestion are: First, the distinction between purchases and 
expenditures implies that a separate model would be required to predict future trans- 
fers (including interest payments), which is not obviously easier than modeling pur- 
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The stress on transitory movements in government purchases 
suggests special attention to war-related expenditures, which are 
likely to be viewed as largely temporary. I have proceeded empirically 
by separating total government (federal plus state and local) real 
purchases of goods and services into a “defense” component, G”, and 
other purchases, G”. The present analysis does not attempt to classify 
components of government purchases in accordance either with their 
relative substitutabilities with private spending, as reflected above in 
the 6 parameter, or with their role as inputs to private production, as 
measured above by the MPG parameter. Differences between defense 
and nondefense items with respect to these parameters affect the 
interpretation of some of the empirical findings. Presumably, defense 
purchases are characterized by a relatively low value of 6 and possibly 
by a relatively high value of MPG. The former implies a relatively 
large output effect of temporary defense purchases, while the latter 
would enhance the output effects of both temporary and permanent 
defense purchases. The empirical analysis would be sharpened by 
obtaining a division of nondefense purchases into relatively homo- 
geneous categories with respect to the @ and MPG parameters, but 
the feasibility of this classification is unclear. Transfer payments 
have not been included in the analysis. 


Defense Purchases 


A primary determinant of GY would be the level of current and 
anticipated future wartime activity, assuming that at least the timing 
of wars can be treated as exogenous with respect to expenditure 
decisions. I have quantified this influence by using a casualty rate 
measure B,, which represents battle deaths per 1,000 total population 
(see table 1) for the wartime years since the Civil War: 1898, 1917-18, 
1941-45, 1950-53, 1964-72. In effect, this variable can be viewed as 
an alternative to a set of wartime dummy variables. The casualty rate 
measure represents an attempt homogeneously to quantify the inten- 
sities of different wars and different years within each war, without 
using military expenditures or personnel measures, which are the 
types of variables that are to be explained. In particular, the use of 


chases directly; and, second, the use of the tax rate to proxy the permanent expendi- 
ture ratio may work better for the federal government than for total government. See 
Benjamin and Kochin (1978), who argue that mobility possibilities would prevent state 
and local governments from choosing an excess-burden-minimizing debt policy. How- 
ever, this issue involves also the federal government’s interaction with state and local 
governments—that is, the federal government may compensate for public debt/tax 
variations that cannot be carried out at the state and local levels. 
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TABLE 1 


CASUALTY RATE VARIABLE 











Date B Date B Date B 
L8QSSe on cee -0052 W945 Gece scea,c .603 ROGGR er ace: .025 
(Ol 7 Fee 123* TOS OM RARE. a .071 L9G}: .047 
LODSpe Stee .28* 1 OS eee Re .097 NOGS MhAre..e .073 
1O4 Me ek .0044 OS Oi ates .030 TOC OME ek 2c -046 
194 2 Perec oy TO 53 eee oe .021 O70 Mees: .021 
TOSS) Sees <2 .205 LOGS FES os .0014 197 | eee .0067 
LO44 rretiscepe 1.090 T9OG5): Byoeteee .0070 1972. os? -0014 





Sources.—Vietnam (1964-72): Statistical Abstract of the United States, 1977, p. 369, table 590. World War I 
(1917-18) and Spanish American War (1898): Historical Statistics of the United States, 1975, p. 1140, line 880. Korean 
War (1950-53): relative yearly data from Department of the Army, Battle Casualties of the Army (1954), were applied to 
war total from Statistical Abstract of the United States, 1977, p. 369, table 589. World War II (1941-45): relative yearly 
data from Office of the Comptroller of the Army, Army Battle Casualties and Nonbattle Deaths in World War II: Final 
report, December 1941—December 1946, were applied to war total from Statistical Abstract of the United States, 1977, p. 369, 
table 589. Korean War and World War II data were obtained from William Strobridge, Chief, Historical Services 
Division, Department of the Army. 

Nore.—B is battle deaths per 1,000 total population. Values of zero apply to dates not listed. Orders-of-magnitude 
values of B (per year) for earlier wars are: Revolution (1775-83), 0.2; War of 1812 (1812-15), 0.08; Mexican War 
(1846-48), 0.04; Civil War (1861-65, union only), 1.0. Casualty figures are from: Civil War: Historical Statistics of the 
United States, 1975, p. 1140, line 880; other wars: Department of the Army, History of Military Mobilization in the United 
States Army, 1955, appendix A. 

* Yearly data were unavailable. Figures are based on war total assuming equal rate of casualties per month. 


separate dummy variables for each war would remove any power 
from the statistical tests that are carried out below. 

Because of improvements in the technology of caring for wounded 
and offsetting changes associated with the “efficiency” of weapons, it 
is possible that the casualty rate variable does not consistently measure 
the intensity of war at different dates. I considered using a broader 
casualty measure that included wounded, but the ratio of this concept 
to battle deaths showed no trend at least since the Spanish American 
War.” Since I was unable to obtain reliable annual data on wounded 
for World War II, I have restricted my analysis to the narrower battle 
deaths concept of casualty rates. 

Prospective wars would be likely also to influence current spending, 
with good information on forthcoming military actions existing prior 
to at least the U.S. entrances into World Wars I and II. Since I have 
been unable to construct any instruments for these war expectations, I 
have introduced some actual future values of B into an equation for 
current defense spending. This procedure introduces errors-in- 
variables problems into coefficient estimation, although the present 
analysis is concerned primarily with obtaining conditional forecasts 
rather than with coefficient estimation, per se. A later part of the 


3 The ratio of total casualties (including wounded, but excluding deaths that were 
unrelated to combat) to battle deaths is 5.3 for the Spanish American War, 4.8 for 
World War I, 3.3 for World War II, 4.0 for the Korean War, and 4.3 for the Viet- 
namese War. See the notes to table 1 for sources of casualty data. 
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empirical analysis considers a specification where future values of the 
B variables are excluded from the estimation. Lagged effects of B on 
spending are introduced also into the equation for defense purchases. 
Empirically, two annual leads, the contemporaneous value, and up to 
a third lag of the B variable were found to be important. 

Since defense expenditures involve a substantial investment com- 
ponent, the amount of current spending would tend to be influenced 
negatively by the size of existing capital stocks. Accordingly, I have 
included in a defense spending equation the variable K¥%,, which 
measures the beginning-of-period real stock of military equipment, 
structures, and inventories (table 2, col. K”). The relation of capital 
stock to current spending is assumed to be given by 


Kee OGr tele 10) Keay, (8) 


where 6 is a depreciation rate and b measures the fraction of total 
defense spending that constitutes investment (net of within-year de- 
preciation on this investment). The K” series was constructed with 
values of b and 6 that varied over time (see the nn. to table 2), but I have 
limited the theoretical discussion below to situations where these 
parameters are approximated satisfactorily as constants. 

The estimating equation for GY takes the form 


ef = (GYIY); = a By Fo + a Beg FG Bi FOB as — yhe a + Ui; 
(9) 
where G?’ is real defense purchases, Y; is real GNP, k¥?, = TON ee 
and u; is a stochastic term. Note that the dependent variable in 
equation (9) is g¥’, the ratio of real defense purchases to real GNP. The 
main part of the subsequent analysis is carried out in terms of ratio 
variables of this type. The form of equation (9) implies that a doubling 
On Ke! and Y 2 78tor given values of the B variables, leads to a 
doubling of GY. The model for determining g?’ over time will be used 
to determine the currently perceived permanent flow of real pur- 
chases when expressed relative to real GNP, gY = (G4Y),. 
The error term in equation (9) was modeled satisfactorily as a 


random walk, so that estimation can be carried out readily in first- 
difference form: 


Dep =a DB, +... + asDBi_, + a,DBig + GeDB i142 — yDRY. + &, 
(10) 
where D is the first-difference operator and €, = u; — u;_; is a white- 
noise error term. A constant is insignificant when added to equation 
(10) in the empirical analysis—that is, there is no trend in the defense 
purchases ratio. Moving-average error terms or more complicated 


autoregressive error structures also did not add to the explanatory 
value of the equation. 
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The form of equation (10) implies that a current shock e,—which is 
not associated with wartime in the sense that the values of the DB 
variables are small—would have a permanent effect on the future 
mean level of g”. Because of the inclusion of the k”%, term with a 
negative sign in equation (10), the effect on g? of e—that is, of the 
current actual value g} with values of the B variables and k”, held 
fixed—turns out to be positive, but somewhat less than one-to-one. 
On the other hand, if the distribution for future values of B is 
stationary in level form, positive values for DB tend to be followed at 
later dates by negative values, which lead to decreases in future values 
of g”. In other words wars and the accompanying levels of expendi- 
tures are modeled as temporary. This mechanism implies that an 
increase in g?’ that is accompanied by positive values of DB will have 
much less effect on g? than will the equivalent peacetime change in de- 
fense spending. Wartime spending has an appreciable effect on g¥ 
only to the extent that military expenditures depart from the amount 
associated typically with the current level of war intensity. The possi- 
bility that wars have a systematically important permanent effect on 
the purchases ratio is ruled out by the form of equation (10). Some 
alternative specifications of the error process that would have admit- 
ted this type of persisting effect were not supported empirically for 
the United States. 

As detailed in the Appendix, equations (10) and (8) can be used to 
express expected future values of the defense purchases ratio, g}4;, as 
a function of the latest observed ratio g?, the value k# ,, and actual and 
expected future values of the war intensity variable, B. Summation 
over these expressions with proper allowance for discounting yields a 
relation for the permanent purchases ratio g? in terms of gv’, ki”,, the 
array of B variables, and a real discount rate p (which equals the dif- 
ference between the real interest rate and the growth rate of real GNP). 
Finally, a simple specification for the stochastic structure of the B 
variable—based on the frequency, intensity, and duration of wars 
over the full history of the United States—is used to solve for the 
expected future values of the B variables. With this substitution, g? is 
determined as a function of observable magnitudes, up to the setting 
of a discount rate. In the case where future values of the B variables 
are admitted into the government purchases equation (10), er ends 
up as a function of the values Bj45, . . . , By_3— that is, the values Bj,, 
and B,,,. are treated as observable at date ¢. Ina situation where future 
values of B are excluded from equation (10), g?’ is expressed in terms 
of the current and lagged values, B;,, . . . , By_3 (as well as the values of 
g? and k?,). 

The main product of this exercise from the full empirical analysis is 
the series for temporary real defense purchases expressed relative to 


TABLE 2 


GOVERNMENT PURCHASES VARIABLES 











Date gue et g’ — g” ket grsg 
1889 0060 mee Bete Sake .097 
1890 0057 ee: ae re .094 
1891 0057 a sg re .094 
1892 .0056 to ae aee: 089 
1893 .0062 situf S56 iia .097 
1894 .0064 abi eer a: 102 
1895 0053 a ne ye .092 
1896 .0059 ae Sat hte .097 
1897 0080 mee i 31 091 
1898 .0192 ee eee Pal .094 
1899 .0165 ae: ec ht 087 
1900 0131 bas 766 Sys .088 
1901 .0113 eet ext mete 081 
1902 .0110 bass MS a8 084 
1903 0108 ee aN: 18 .088 
1904 .0123 aes: BS hae .088 
1905 BO sues aie we .089 
1906 .0093 nae ae Mes 082 
1907 .0090 were sa: 2% 091 
1908 0117 mes ae Be .110 
1909 .0105 ae sce aes 089 
1910 .0100 oe FW bem 091 
Oe 0101 Ge na ares 105 
ene .0094 os: oo eke .100 
1913 .0096 ae ets ae .096 
1914 .0139 ae net oe .106 
1915 .0135 Sf: Ae se ea 
1916 .0164 ae Sew nee .093 
1917 .076 sts Siete see 085 
1918 .258 re bes ie 080 
LOE .156 BOC fe: oer .049 
1920 .038 oe Ba ease 085 
1921 033 nee aie tts 25 
1922 O17 ohare ees Eek 116 
1923 014 ay sis Bt 105 
1924 Olt Soe ees a: 116 
1925 012 ae ae bea 115 
1926 O11 ah wie st .108 
1927 012 us spit ae 118 
1928 013 a oo woe aL 
1929 013 ae as 055 stil 
1930 O15 030 —.015 056 141 
1931 O17 031 eA 057 .158 
1932 O19 .033 S014 .063 75 
1933 016 .032 =LOUG .063 .176 
1934 016 2032 ONG 056 187 
O35 017 031 et ee 050 174 
1936 .018 031 OS, 045 ao) 
1937 016 029 OES .043 165 
1938 019 031 = .046 .187 
1939 017 029 a Ol2 045 .183 
1940 028 025 003 052 .163 
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TABLE 2 (Continued) 











Date eye oe gt = ge kwt ers 
194] pE20 .065 .055 .074 22 
1942 Solty .104 a2 .164 .092 
1943 439 .093 346 .309 .069 
1944 .463 .074 389 431 .064 
1945 .410 .069 341 470 .063 
1946 .103 .063 .040 483 .092 
1947 .055 .069 —.014 414 .106 
1948 .056 .072 =.017 BOO. mley7i 
1949 .064 .091 —.026 .290 alo 
1950 .066 .074 —.008 .239 lity 
1951 nS .098 .026 .209 LOM 
1952 .156 ela .030 .225 SG 
1953 .156 .140 .016 .260 118 
1954 2133 4132 .001 291 119 
1955 5 lay ae el .284 .116 
1956 pa 2 2129 a7 Olu .283 males) 
1957 .116 3133 Olea .286 mg 
1958 .115 2133 Sees .296 .134 
1959 .108 129 eae .286 129 
1960 LO? .124 ae .282 2133) 
1961 .104 .124 —.021 .278 138 
1962 .103 1) {020 .268 .138 
1963 .096 .116 nL .263 .142 
1964 .087 .107 Seal 251 145 
1965 .080 .096 —.016 a2 3'5 147 
1966 .088 .090 —.002 219 .146 
1967 .098 .086 O11 216 .149 
1968 .096 .081 .014 ol 151 
1969 .088 .080 .009 .206 .150 
1970 .079 .083 — .004 .201 .154 
1971 .068 .080 eae li alii 157 
1972 .063 .076 —.014 .169 .153 
1973 .056 .074 —.017 LSS .148 
1974 .055 .071 —.017 147 Lb, 
1975 .055 .070 —s016 14] .164 
1976 .050 .067 = PAUES .128 .156 
1977 .049 .064 OES 118 2Lb2 
1978 .046 .060 —.014 mu) .150 





* g” = G“/Y, where ¥ is real GNP (1972 base). G” is real defense purchases (1972 base). Data since 1929 are from 
National Income and Product Accounts of the United States and recent issues of the United States Survey of Current Business. 
The fraction of nominal defense purchases in total nominal federal purchases was multiplied by figures on real 
federal purchases (1972 base). Data from 1889-1928 are from Kendrick (1961, table A-I, col. 5). Figures were 
multiplied by 4.8, based on the overlap for 1929. 

+ G* is the estimated normal defense purchases ratio, as calculated from eq. (14) in the text. 

+h” =K~/Y, where K “ is the end-of-year value of net real stocks of military structures, equipment, and inventories 
(1972 base). Data from 1929-69 are from Kendrick (1976, table B-24) converted from a 1958 to a 1972 index by a 
constant multiple (1.72). Figures were extended to 1978 using data on various expenditure components: military 
structures, AEC structures, military equipment, AEC equipment, inventories for GSA stockpiles, and inventories for 
AEC stockpiles. Depreciation estimates were based on rates used by Kendrick within each category. His calculations 
assume a higher rate of depreciation during World War II. 

§ g? =G?/Y, where G” is real nondefense purchases of the federal plus state and local government sectors (1972 
base). G” was calculated as total real government purchases G less G”. Sources for G correspond to those above for 
G”, except that Kendrick (1961, table A-Ila) was used for data from 1889-1928. 
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real GNP, (g” — g”),, which is indicated over the 1930-78 period in 
table 2. (This series corresponds to the choice of discount rate—the 
difference between the real rate of interest and the growth rate of real 
GNP—of .02 per year.) As is evident from the table, this variable 
identifies the years associated with wartime, 1940-46, 1951-53, 
1966-69, as times when the defense purchases ratio is above its 
perceived long-run average value. Although the underlying model 
allows a quantitative assessment of the gap between g? and g? in the 
context of wars with different intensities and in an environment of 
nonconstant values for g?, it is also clear that the general pattern for 
the (g” — g”), variable would be robust to some changes in the 
underlying model. Notably, the important aspect of the stochastic 
specification for the war intensity variable B is the temporary nature 
of wars, rather than the details of war probabilities. Substantial varia- 
tions in the discount rate p also have minor effects on the results. 

It should not be surprising that the (g” — g®), variable exhibits a 
substantial amount of positive serial correlation. In this respect a gap 
between current and normal values—which the (g” — g”), variable is 
intended to capture—should be distinguished from the spread be- 
tween actual and anticipated or perceived amounts, which has been 
stressed in earlier analyses of monetary disturbances (Barro 19816). 
The latter type of variable exhibits serial independence as a conse- 
quence of rational expectations and the assumption that information 
is received with, at most, a one-period lag. This type of argument does 
not apply to a variable that measures temporary effects. In the case of 
the temporary defense purchases variable, the large number of (se- 
rially correlated) peacetime years with small negative values of (ease 
g”), is offset by a smaller number of (serially correlated) wartime 
years with excesses of g? over g?. (However, the years that are 
significantly affected by war—for example, the set 1941-46, 
1950-53, 1965—~72—should not be deemed special, since they con- 
stitute 47 percent of the years since 1941 and 39 percent of those since 
1946.) 


Government Purchases of Nondefense Items 


The nondefense portion of government purchases—g” =G?/Y, where 
G” is nondefense real purchases—was examined statistically over 
samples beginning in 1929. This study revealed little predictive value 
for first differences, Dg?, except for a negative association with the 
contemporaneous change in the defense component, Dg”.24 In par- 


* Past history of the residuals, lagged values of Dg” or Dg”, a capital stock measure 
Dk’, and a constant were all insignificant. 
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ticular, there is no drift in the nondefense purchases ratio. The 
negative association of Dg? with Dg” reflects the crowding out of 
nondefense government spending during wartime. The dependence 
of Dg? only on Dg” means that departures of g? from the perceived 
normal value g” are determined completely by the difference between 
g” and g”. With g” — g” held fixed, changes in g? amount entirely to 
shifts in the permanent component of nondefense purchases. Ac- 
cordingly, with the g” variables entered separately, the coefficient of 
the g? variable in an output equation would reveal the effects of 
permanent changes in nondefense purchases. It is not possible here to 
estimate the response of output to temporary changes in nondefense 
purchases, since no temporary changes were isolated over the sample. 


Empirical Results 


The principal empirical analysis involves joint estimation of the gov- 
ernment purchases equation (10) and a relation for output that is 
based on the form of equation (4). With the defense and nondefense 
components of government purchases entered separately, the output 
equation becomes 


log(Y;) ee Big” a gs 1 Bog? or Bg? . (11) 


Note that the real government purchases variables all appear as ratios 
to real GNP. The variable g¥ = (G“/Y), is determined as a function of 
observables from equation (10) when used in conjunction with some 
relations that are derived in the Appendix (eqq. [A5] and [A6]). 

The first set of hypothesis tests involves the output effects of the 
(g’ — g”), and g? variables in equation (11)—specifically, that the 
coefficients of these variables satisfy the restrictions, 0 < B, S B, < 1. 
As indicated earlier, the coefficient on the g? variable, 83, reveals the 
output effect of a permanent change in nondefense purchases. If 
nondefense purchases were characterized by closer substitutability 
with private consumption expenditure (the @ parameter) and by lesser 
impact on private production (the MPG parameter) than defense pur- 
chases, then 8, > 83 would follow. However, this condition cannot be 
viewed as a firm implication of the theory. The model also suggests 
the restriction, 0 < £3; < 1. 

The second set of hypothesis tests checks whether the explanatory 
variables for gi’—in this case By4., . . . , By-3 and ki2,—enter an 
unrestricted reduced form for output as determined solely by their 
role in determining g? in accordance with the coefficients of equation 
(10). 

The analysis is contingent on a value of the discount rate p—the 
difference between the real interest rate and the growth rate of real 
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GNP—in the calculation of the gv variable (see eqq. [A5] and [A6] in 
the Appendix). However, the results turn out to be relatively insensi- 
tive to variations in the p parameter at least over the range from .01 to 
.05 per year. The reported results refer to a fixed value of p = .02 per 
year, which is a plausible magnitude, ex ante, and which approxi- 
mates the maximum likelihood estimate for this parameter. 

Jointly estimated equations for real defense purchases and real 
GNP were calculated by means of a nonlinear, maximum likelihood 
routine from the TSP regression package, which includes estimation 
of contemporaneous covariances for the error terms. The estimation 
is joint in the sense of incorporating the role of the coefficients from 
equation (10) in determining the series g? and thereby influencing the 
fit for output in the form of equation (11). In particular, the 
coefficients in the equation for Dg? are not determined solely to 
obtain a best fit of equation (10). I have not carried out joint estima- 
tion in the broader context of choosing the number of leads and lags 
of the B variable to include equation (10), in deciding to omit 
moving-average error terms in this equation, in analyzing the process 
for nondefense purchases, and so on. 

Since the dependent variable, real GNP, appears also in the de- 
nominators of the ratio variables g” and g?, there is a possible simul- 
taneity problem in the estimation. Accordingly, I have used as in- 
struments for g? and g? the lagged values, g¥, and g?_,, and also the 
contemporaneous values, G?/Y, and G?/Y,, where Y, is the trend value 
of real GNP, as determined from a regression over the 1946—78 
period of log(GNP) on a constant and time. The estimates are not 
altered substantially if only the pair of lagged values or only the pair 
of contemporaneous values relative to trend are used as instruments. 
Empirically, the movements in gi’ are dominated sufficiently by con- 
ditions of war or peace that the use of instruments yields estimates for 
the coefficients of the (g” — g”), and g? variables in equation (11) that 
differ only in minor ways from ordinary-least-squares (OLS) values. 
However, the use of instruments is important in the case of the gr 
variable—OLS estimates for the B3 coefficient in equation (11) appear 
to be biased downward substantially because of the inclusion of Y, in 
the denominator of the g? variable. 


Results for Post-World War II Output Sample 


For an output sample that begins in 1946, the results of the joint 
estimation of equations (10) and (11) are, for the 1932-78 sample: 


Dg? — .163DB 45 ae -198DB,,, ate .273DB, ce .240DB,_, 
(013) (.012) (014) -(.017) 
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=".022DB,-5 + .088DBi_3 = .26DkY.,, 


(.015) (.016) (.08) ae 
o = .0143, D-W = 1.7; 
and for the 1946-78 sample: 
log(Y;) = 2.97 + .0343 - t + .83DMR, + 1.12DMR,_, 
(.04) (.0008)  (.22) (22) 
+ .99(g" — g), + 55 pF + .62g?, (13) 
(.21) (.12)  (.45) 


o = .0143, D-W = 1.5. 


Asymptotic standard errors are shown in parentheses below the 
coefficient estimates. The & values are asymptotic estimates of the 
standard errors of the disturbance terms; D-W is the Durbin-Watson 
statistic. Note that the sample for the government purchases equation 
(12) begins in 1932 and thereby includes the World War II experi- 
ence. 

Variables included in equations (12) and (13) are: Y, real GNP 
(1972 base); t, time trend; DUR =DM — DM is “unanticipated money 
growth,” as measured in earlier research (Barro 19816), where DM is 
an estimated value of money growth from an equation that is based on 
the M 1 definition of the money stock;* g” = G“/Y, where G” is real 
defense purchases (1972 base); g? = G®/Y, where G? is real, non- 
defense, federal plus state and local purchases (1972 base); B, casualty 
rate variable as defined in table 1; and k” = K“/Y, where K” is real 
government defense capital stocks (1972 base). 

For present purposes I focus on the role of the government pur- 
chases variables in equation (13). The money shock variables have 
effects that are similar to those discussed in previous research, as 
reported in Barro (19810). 

The g? variable in the output equation is based on the specification 
for Dg? that appears in equation (12). The main result from this 
equation is the strong positive spending effect of wars, as measured by 
the casualty rate variable B. The equation shows a 2-year lead effect of 
the B variable and a lagged effect out to 3 years. (The negative effect 
on Dg? of the DB,_, variable is difficult to interpret.) The conse- 
quences of eliminating the future values of DB from this equation are 


oe DM is determined from an equation that is estimated over the 1941—78 sample: 


DM, = .095 + .49DM,_, + .16DM;-2 + .069FEDV, + .030 - log(U/(1 — U)i-1, 
(.024) (.14) (.12) (.015) (.008) 
where observations from 1941-45 are weighted by .36. FEDY, is real federal spending 


relative to a distributed lag of itself, and U is the unemployment rate in the total labor 
force. See Barro (19816) for a discussion of this type of equation. 
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discussed later. For present purposes the most important aspect of 
war spending is its temporary nature, although precise calculations 
for g? involve the distributed lag pattern of DB effects on Dg? and the 
implications of these responses for the behavior of the capital stock 
ratio k”. Equation (12) shows also the expected negative effect of Dk?., 
on Deg?. 

Using equations (A5), (A6), and (A11) from the Appendix, and the 
value p = .02 for the discount rate, the point estimates of coefficients 
that appear in equation (12) can be shown to imply the formula for g?” 
as follows: 


gr = 011 + 67g? + .15k2, — .06B,, + .02B,-. — .16B,_, 


(14) 
a .18B, 4 2B =F (O7 Bae 


This relation is a particular form of equation (6). Equation (14) shows 
a positive but less than one-to-one effect on gi of gi’, a positive effect 
of k, (for a given value of g/’), and a basically negative effect of the 
casualty rate variables (again given the value of gi”). Values of g? that 
are calculated from equation (14) are shown along with values of gi’ in 
table 2. 

The temporary defense purchases variable, (g” — g”),, has a 
significantly expansionary effect on output. The estimated 
coefficient” in equation (13) is B, = .99, S.E. = .21. The “t-value” 
corresponding to B, = 0 is 4.7. The normal defense purchases vari- 
able, gi’, is also significantly expansionary in this equation—f, = .55, 
S.E. = .12, which implies a t-value of 4.6. The estimated effect for the 
permanent purchases variable is somewhat greater than half that of 
the estimated temporary effect.2” The results permit rejection of two 
extreme hypotheses: first, that only the temporary part of purchases 
affects output (which would require Bo, the estimated coefficient of 
the g? variable in eq. [13], to differ insignificantly from zero), 


** Because of the negative correlation of (g” — ge”), with g?, the B, coefficient picks up 
an additional effect. The extra term involves the difference between the output effects 
of permanent and temporary nondefense purchases. The output coefficient associated 
with temporary nondefense purchases could not be estimated with the available data. 
However, since the regression coefficient of g? on (g” — g”), is on the order of —0.1, it is 
unlikely that the overall modification is important. 

* It has been suggested that the temporary government purchases variable may be 
proxying for the effects of accompanying federal deficits. The analysis in Barro (1979) 
documents the strong positive effect of temporary federal spending, as in wartime, on 
public-debt issue. Some preliminary results in Barro (19800) indicate that lagged “debt 
shocks” have expansionary effects on output that are statistically significant but sub- 
stantially weaker than those of monetary shocks. However, this constructed debt-shock 
variable filters out the normal positive association between temporary government 
spending and the deficit. With these debt shocks held fixed, the actual lagged values of 
public-debt growth have no explanatory value for output. This last finding suggests 
that the strong expansionary influence of temporary defense purchases does not 
involve a proxying for the effect of correlated movements in the federal deficit. 
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and second, that temporary and permanent purchases are of equal 
importance for output. The latter case would correspond to equal 
coefficients (8, = By) for the (¢” — g”), and gi’ variables—that is, to the 
proposition that the coefficient of the g? variable would be zero in an 
equation that held fixed the value of the actual purchases ratio, g/’. 
For convenience, the results from equation (13) can be rewritten in 
this form as 


LOR ieee O0T  41 oy 
ey (24) 


The hypothesis that the coefficient of the gi variable equals zero 
corresponds to a t-value of 1.8, which is significant at the 5 percent 
level for the case of this one-sided test. That is, the null hypothesis of 
equal output effects for temporary and permanent defense purchases 
(B: = Bz in eq. [11)) is rejected in favor of the hypothesis that 
temporary purchases are more expansionary, B, > B».28 

The estimated coefficient on the (g” — g’°); variable in equation (13) 
implies that a temporary change in the level of real defense purchases 
has almost a one-to-one effect on the contemporaneous level of out- 
put. While this finding is consistent with the restriction, 8, < 1, the 
evidence would also be consistent with a moderate multiplier relation- 
ship between temporary government purchases and output. The rel- 
atively high estimated output effect is associated in the theoretical 
model with a small value of the 6-coefficient, a high value of the MPG 
parameter, and a high real-rate-of-return elasticity of aggregate sup- 
ply relative to that of demand. 

The estimated coefficient on the g? variable in equation (13) implies 
that a permanent increase by one unit in real defense purchases leads 
approximately to a one-half unit rise in real GNP. This result accords 
with the restriction 8, < 1—moreover, the estimate is significantly 
below unity in this case. 

The estimated coefficient of g?, the nondefense purchases ratio, is 


*8 I considered discriminating between temporary and permanent defense purchases 
by utilizing a measure of the return on the equities of defense contractors relative to 
that on a market portfolio of New York Stock Exchange (NYSE) stocks. (The relative- 
returns variable was constructed using data on total returns to NYSE stocks from the 
Center for Research in Security Prices of the University of Chicago. A list of defense 
contractors and the size of these contracts for 1969 from the Department of Defense 
was kindly supplied to me by Claire Friedland.) This relative-returns variable has no 
explanatory power when added to a first-difference form of the output equation with 
Dg?, Dg?, and the determinants of Dg/ from the form of eq. (12) included as 
independent variables. Conceptually, it is unclear whether the relative-returns variable 
signifies an increase in war probability and, therefore, that current defense expendi- 
tures are more likely to be temporary, or an increase in the long-run expected quantity 
of defense purchases, which would imply that current defense expenditures are more 
likely to be permanent. Therefore, the sign of the variable is ambiguous on theoretical 


grounds. 
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B; = .62, S.E. = .45, which is positive but imprecisely determined. The 
relatively small amount of independent sampling variation in this 
variable since 1946, in conjunction with the necessity of using instru- 
mental variables, results in the high standard error. (Using OLS 
techniques, the estimated coefficient of the g? variable is negative.) In 
any event the hypothesis that the coefficients of the g? and g?’ variables 
are equal, 8B. = 83, would be accepted from the present evidence. As 
noted earlier, the relative output effects for these two types of perma- 
nent movements in government purchases depend, in the theoretical 
model, on the relative values of the 6 and MPG parameters. Possibly, a 
more precise determination of the g? coefficient would be obtained by 
extending the analysis to the 1930s, during which major changes 
occurred in the nondefense purchases ratio. The main obstacle for 
this extension is the isolation of monetary shocks, which seems to 
require a specification for the pre-World War II monetary regime 
that differs from that used for the post-1941 period (see n. 25 
above).”° 

A combination of equation (11) with an expression for g? in the 
form of equation (14) implies a reduced-form relation for output in 
terms of a constant, a time trend, DMR variables, gi’, g?, the B vari- 
ables, and &,. Unrestricted estimation of this reduced form affords a 
test of the hypothesis that the determinants of g?—specifically, the B 
variables and k2,—affect output only in the manner implied by the 
forms of equations (11) and (14). The test is based on the likelihood 
ratio corresponding to unrestricted and restricted forms of joint esti- 
mation. The value of —2 - log(likelihood ratio) turns out to be 3.2, 
which is below the 5 percent critical value for the x? distribution with 6 
degrees of freedom (the number of coefficient restrictions in this case) 
of 12.6.*° Therefore, the hypothesis that the determinants of g¥ enter 
only in this indirect manner in influencing output is accepted. 


*’ It is worth noting that the point estimates for the government purchases 
coefficients in eq. (13) change little if the output equation is respecified in first- 
difference form (see Plosser and Schwert [1978] for a discussion). The estimated 
equation in this case is, for the 1946-78 sample: 


D log(¥;) = .036 + .83D(DMR,) + 1.01D(DMR,_,) + SID ee BF”); 
) 


(.006) (.29) (.34) (.22 
+ .55Dg? + 0.63Dg?, & = .0176, D-W = 2.5. 
(39) (1.67) 


The jointly estimated equation for Dg’ is similar to that shown in eq. (12). Note that the 
constant in the equation for D log(Y;) corresponds to the time trend for the level 
equation. The main change from the previous specification is the higher standard 
errors for the gi and g? coefficients. The Durbin-Watson statistic of 2.5 suggests 
overdifferencing. 

* If the discount rate p were regarded as a freely estimated parameter, there would 
be only 5 degrees of freedom, which would imply a critical value of 11.1. 
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There would, of course, be many possible output effects of war that 
do not operate through the channels that were specified in the pres- 
ent model. For example, there would be responses to conscription 
and patriotism, and the possibility that war would threaten future 
property rights. Some of these effects would, however, influence 
output in a manner similar to that of the pecuniary intertemporal 
substitution variable that was stressed in the theoretical analysis (see 
Barro [198 1a] for a discussion of wartime influences on financial rates 
of return). In particular, the level of temporarily high demands on 
resources by the government, (g” — rou may proxy satisfactorily for 
the full range of wartime output effects. 

More generally, because the war variables are the prime basis in this 
work for distinguishing temporary from permanent movements in 
government purchases, it would be infeasible to allow unrestricted 
direct wartime output effects and still carry out interesting tests of the 
underlying hypotheses. In any event the restriction that the war 
variables influence output only indirectly through influences on tem- 
porary government purchases is satisfied in the present case. 

Elimination of future values of the casualty rate variable from the 
government purchases equation (12) has a substantial effect on the 
estimation of this equation for the 1932—78 sample. Aside from a 
major deterioration in fit, the residuals then show pronounced posi- 
tive serial correlation. These effects are dominated by the World War 
II years—in particular, from the rise in military spending in 1940-41 
prior to the onset of casualties and from the major advance in spend- 
ing in 1942-43 before the peak in casualties for 1944—45. In these 
cases it seems reasonable to treat the future casualty values as rough 
proxies for contemporaneously available information about the inten- 
sity of the war. (Another possibility would be to use a foreign casualty 
rate variable, but data limitations and conceptual problems concern- 
ing the perceived threat attached to foreign conflicts have prevented 
the implementation of this idea.) 

For the post-World War II period, this type of advance informa- 
tion on war intensity seems less important and, in fact, the future 
casualty variables lack significant explanatory power for defense ex- 
penditures over this period. With these future values deleted and 
1946-78 samples used throughout, the results of the joint estimation 
are, for the 1946-78 sample: 


Dg = .264DB, aie .226DB,_, a .O27DBi-» = .O79DB 5 
(061) —(.043) (021) (.022) 


— 30DKE,,  ¢& = .0128, D-W = 1.9; 
Gun) 


(15) 
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and 


log(Y;) = 2.99 + .0340 -t + .88DMR, + 1.12DMR,_, 


(.06) (.0010)  (.25) (.26) 
+ .96(g” — 2”), + 52g? + .74g?, (16) 
(.26) C7 eae 2) 


o = .0177, D-W = 1.5. 


Although the fit of the estimated output equation (16) is poorer than 
that shown in equation (13), the pattern of estimated coefficients and 
standard errors is similar. Therefore, the conclusions on output ef- 
fects of the government purchases variables are not sensitive to, first, 
elimination of the World War II years from the sample for the govy- 
ernment purchases equation and, second, removal of the future val- 
ues of the casualty rate variable from this equation. The results also 
remain similar if the starting date for the output equation is. shifted 
from 1946 to 1950, which removes some contribution of the World 
War II years that works through the effects of lagged B values on the 
constructed g? variable. 


Addition of World War II Output Experience 


Rather than insulating the results from World War II, it is in many 
respects more informative to evaluate the performance of the model 
during this extreme experience. Clearly, the sample variation in the 
wartime-related variable, (g” = g®),, is raised enormously by this 
extension of coverage. On the other side, the inclusion of the World 
War II years raises problems that concern the measurement of real 
output during a period of extensive price controls and the accuracy of 
linear specifications for extreme observations. 

Jointly estimated equations that include the 1942—45 observations 
on output are, for the 1932-78 sample: 


Det — OSD Bea. == .196DB,., +P -274DB, a -242DB,_, 


(.013) (.013) (015) — (.017) 
— .022DB,» + .087DB,-, — .25Dki,, 1) 
(.015) (.016) (.08) 
od = .0143, D-W = 1.7; 
and for the 1942-78 sample: 
log(¥:) = 2.97 + .0350 - 1 +.76DMR, + IO0DMR,_, (18) 


(.05) (.0009) (.15) G5) 
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ee Pt Aer pe 
(.06) (12) G51) 


o = .0154, D-W = 1.4. 


A test that the 1942-45 observations for output conform to the 
same structure as that for the other years corresponds to a value for 
—2 - log(likelihood ratio) of 37.9, which exceeds the 5 percent critical 
value of the x? distribution with 4 degrees of freedom of 9.5.3! It 
seems likely that this appearance of structural break during World 
War II would not appear if the functional form were altered to allow 
for a nonlinear dependence of log(Y,) on (e2 =o"), Specifically, the 
most important change from the 1946-78 estimates in equation (13) 
to the 1942—78 values in equation (18) seems to be the drop in the 
estimated coefficient of the (g” — g”), variable, B,, from .99, S.E. = 
213 10,.11,.5-6. =1.06. Atunctional representation that allowed for 
positive, but diminishing, output effects of temporary government 
purchases would probably account for the overall results in a 
homogeneous form, but I have not experimented along these lines. 
As the results in equation (18) stand, they reveal the anticipated 
reduction in the standard error of the (g” — g”), coefficient—in fact, 
if this relation were viewed as well specified, the estimated output 
effect of temporary government purchases, Bi, would now be mea- 
sured as significantly less than one.” 


*' I have also carried out the estimation with an allowance for heteroscedasticity in the 
form of a different error variance for the output equation during the World War II 
years, 1942-45. Maximum likelihood estimates indicate that those years have an error 
variance that is 2.6 times that for the 1946—78 period, which corresponds to multiplying 
the 1942-45 observations by .62 in the estimation. The results with the heteroscedastic- 
ity correction applied are, for the 1942—78 sample: 


log(Y,) = 2.98 + .0347 - t + .85DMR, + 1.01DMR,_, 


(.04) (.0009) (.17) (.17) 
+ .75(g” — g"), + 48g + 34g”, & = 0148, D-W = 1.4. 
(.07) (12) 453) 


Note that the o value applies here to the error term for the 1946-78 period. The 
pattern of results does not differ greatly from that shown in eq. (18). The jointly 
estimated government purchases equation is very close to that shown in eq. (17). A test 
for the addition of the 1942-45 years for output to the rest of the sample corresponds 
here to a value for —2 - log(likelihood ratio) of 30.9, as compared to a 5 percent x? 
value with 3 degrees of freedom of 7.8. (The degrees of freedom are reduced by 1 here 
in comparison with the test for the context of homoscedasticity because of the estima- 
tion of the heteroscedasticity parameter.) 

* The estimated coefficient for nondefense purchases, g?, is also smaller than before, 
but still insignificantly different from that on the gi?’ variable. With gi’ held fixed, the 
implied estimated coefficient on g? is now —.29, S.E. = .12. That is, the hypothesis that 
temporary and permanent defense purchases have equal output effects—B, = B,— 
corresponds here to at-value of 2.4. This hypothesis would, therefore, again be rejected 
in favor of 6; > B2. A test that the determinants of g? from eq. (17) enter only in this 
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III. Conclusions 


The empirical part of this study documents the positive output effect 
of defense purchases. There is evidence that temporary movements 
in defense purchases, which are associated primarily with wartime, 
produce roughly double the response in output as that generated by 
equal-sized, but permanent, shifts in defense purchases. In all cases 
the results are consistent with a dampened, rather than a multiplica- 
tive, response of output. The effects of nondefense purchases are 
imprecisely determined. 

The theoretical section stresses intertemporal substitution variables 
as the channel for the strong positive output effect of temporary shifts 
in government purchases. Preliminary empirical analysis of realized 
real rates of return (Barro 198la) provides some support for this 
mechanism, but further joint consideration of output and real-rate- 
of-return behavior would constitute useful research. 


Appendix 


Derivation of the Permanent Defense Purchases Ratio, g” 


Equation (10) can be written in the form, 
Dg? = A(L)DB,; — yDRY, + &, (Al) 


where A(L) is a polynomial in the lag operator L, which allows both lags and 


leads of DB, to affect Dgi’. Using equation (8), the evolution of k” is governed 
by 


bY = bg? + (1 ~ 8A, a 


where (1 — 8’) = (1 — 6)(1 — A) and J is the (assumed constant) growth rate of 
real GNP. 

Equations (Al) and (A2), together with a specification for the stochastic 
structure of the B variable, imply a distribution for future values of e. 
conditional on information available at date t (which is assumed in the present 
case to include the values of B,,,; and B;4»). Equation (A1) implies that future 
values of the spending-GNP ratio are given by 


Sei = gt + A(L)(Bisi — By) — y(Ra-1 — Ay) + error term. (A3) 


Equation (A2) can be used repeatedly to eliminate future values of k” from 
equation (A3), which leads eventually to the condition. 


indirect manner in influencing output leads to a value for —2 - log(likelihood ratio) of 
17.4, as compared to a 5 percent critical value for the x? distribution with 6 degrees of 
freedom of 12.6. With a heteroscedasticity correction applied for 1942-45 (see n. 31 
above), the corresponding statistic is 10.3. Therefore, with the World War II obser- 
vations included in the output sample, there is greater indication of an output effect of 
war that operates directly and not only through the (g” — g”), variable. However, this 
conclusion could also be affected by inappropriate linear specification of eq. (18). 
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ree gro ey l= 6! = by) 
Bis = Bi 6’ + by 


+ bya, | A= OO Py) 4 4B. ~ B.) (A4) 


— by[A(L) Bri + (1 — 8’ — by)A(L)Bisi-2 
Toe (LO by) AE) Ble errorterm. 


The bracketed expression multiplying gi is positive but less than one, as 
indicated in the text. The coefficient on k“, is positive—that is, future values 
of gi,; rise with k/{, for a given value of gi’—because ki", exerted a depressing 
effect on gi that should be filtered out in determining the “permanent 
component” of g/. The term 4 (L)(B,,; — B;) measures the temporary effect of 
the B variable on g;‘,; relative to that on gi (which is filtered out as above in 
obtaining the permanent component of gi"). The final bracketed term ac- 
counts for the interaction between B-induced temporary spending and the 
resulting negative effect of the implied accumulation of k” on subsequent 
spending. 
The variable of interest for output determination is 


a = ( p ler + > Bgl + p)') 
i=1 


i Se 





where E is the expectation operator. The discount rate is p = r — A, where A is 
again the growth rate of real GNP. The variable g/ can be determined from 
summation over 7 in equation (A4) to be 


Gere pro" J + he ae ae 
gi Elaceea airy No + 8! + by 


Sitter pcweoy | 
1+p/\ pt 6' + by 


(A5) 


where ®, = 27, E[A(L)(Bi4; — By)]/(1 + p)'. Again, the effect of gi is positive 
but less than unitary, and the effect of k/“, is also positive. 

For the case where A(L) = ay + aL + a2l? + asl? + a,(1/L) + a,(1/L)*, and 
where observations on the B variable through B,4, are available at date ¢, the 
®, expression in the last term of equation (A5) can be written as 


®, = B,-3(—a3/p) + By-2[—a2/p + a3/(1 + p)] + Bi-,[—a,/p + a,/(1 + p) 

FM 1 + p)l + BA agp + a,/(1 + p) +. a./(l + p)’ + az/(l + p)*) 

+ By,[—a,/p + ao/(1 + p) + a, /(1 + p)? + ae/(1 + p)® + ag/(1 + p)*] 

A Biol asp tials p) + asf p)* + a, /Cl + py? + aa/(l+ p)* 

+ aj/(1 + p)’) + WV, la, + a,/(1 + p) + ao/(1 + p)? 

Bagh + py + al + py + asi/(1+ py, (A6) 
where VY, = 2, EBj+i+2/(1 + p)' is a variable that measures the effect on 
expected future spending of anticipated future wars (in the present case from 
year ¢ + 3 onward). 

The variable gi in equation (A5) is now related to various parameters and 
the variables gi’, ki ,, Br-3, .. - , Brsz, and W,. The remaining work is to relate 
expectations of future values of B, as entering through the Y, variable, to 
currently observed variables, including values of B up to By4». 
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Expectations of Future Wars 


Calculation of expected future casualty rates is based on the following station- 
ary probability model for wars.* First, a 2 X 2 matrix is specified for the 
probability of war or peace next year (or rather for year ¢ + 3 when conditions 
at ¢ + 2 are assumed known at date ¢), conditional on war or peace prevailing 
currently. It is assumed that information about the future course of B is 
contained fully in the most recent observation, earlier values of B and values 
of other variables not having to be considered. The probability of war during 
at least part of next year, given peace for the latest observation, is based on 
data over the 1774-1978 period—namely, 


p, = Prob(B,,, > 0| B, = 0) = 9/162 = .06, (A7) 


where 162 is the total number of peacetime years in the sample (where B; = 
0), and 9 is the number of these years that were followed by the outbreak of 
war.** Correspondingly, the probability of peace continuing is given by 


(ip) = Probihay = 0|B, = 0) = .94. 


The value of p, is slightly higher if the sample is limited to the more recent 
period 1889-1978 (the sample for which relatively accurate observations on 
G" and B are available), for which the result is p, = 5/68 = .07. 

The probability of the continuation of war is given for the 1774-1978 
sample by 


Po = Prob(By4, > 0|B, > 0) = 33/42 = .79, (A8) 


where 42 is the number of war years (where B, 4 0), and 33 is the number of 
these that were followed by another year of war.* In other words nine wars 
began and ended over the sample 1774-1978. For the 1889-1978 period, the 
result would be p, = 16/21 = .76. Finally, the probability of no war next year, 
given its existence this year,*® is given for the 1774-1978 sample by 


(1 — p2) = Prob(Bi41 = 0 |B, 2 ())i oe 


The expected value of B for the first year of a war is calculated as the mean 
value for the five wars since 1889 (for which accurate data on B are available): 


*8 War probabilities and the distribution of sizes of wars need not be constant over 
time, although there is no indication of substantial structural change in the small 
sample of evidence afforded by the 200 years of U.S. history. (The largest value for the 
B variable would actually apply to the Civil War—see the notes to table 1.) From the 
standpoint of constructing the g/ variable, shifts in the stochastic structure for wars 
would essentially be an alternative to the present specification that allows for shifts in 
spending for a given war structure, as represented by the stochastic variable e, in eq. 
(A1). In the context of output analysis, it is unclear that there would be much empirical 
difference between these alternatives. 

*4 The year 1978 is not included in this calculation, although it could have been if 
peace during 1979 were also included. War years are taken to be 1775-83, 1812-15, 
1846-48, 1861-65, 1898, 1917-18, 1941-45, 1950-53, and 1964-72. There may be 
some objection to starting the sample just before a war (which is not independent of the 
start of U.S. data), but the results are not highly sensitive to this choice. 

*» The probability p, refers to the existence of war during at least part of a year 
following a period of war during at least part of the previous year. 

*6 This calculation pertains to the existence of peace over the entire year ¢ + 1, 
conditional on war during at least part of year t. 
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TABLE Al 
PARAMETER VALUES FOR EXPECTED CASUALTY RATES 
Pp Mo Bi 
.O1 1.95 Oe 
.02 .94 220 
.05 35 2.06 
.10 .16 1.80 
25 051 1.30 
c= E (Biss | Bess = 0, B7— 0) 
(A9) 


=(.005 +23 + 004 +071 + .001) = 062, 
Since war could break out at any time during the year, the annualized value of 
EB,,,, denoted by B4, would be roughly twice the above figure—that is, B4 ~ 
Ae? 

Finally, when B,,, and B, are both positive, the conditional expectation for 
B, is given by 
E (Bess | Bids > 0,B,> 0) = 6) + 0,Bé, 


where B# is the current casualty rate expressed at an annual rate if hostilities 
applied only to a fraction of year ¢t. The parameter 6, is based on the 
assumption (not refuted by the small sample of U.S. data) that wars tend 
neither to grow nor to contract over time, except that war may end at some 
time during year ¢ + 1 as governed by the parameter p,. Accordingly, 6, ~ 1 
—Ya(1 — pz) = .90. The parameter 6, is set so that the value of the V; variable, 
which appears in equation (A6), converges to the value associated with By. = 
0 as B,42 — 0 (which essentially recognizes that a new war may break out next 
year, even if one is already going on). The value of 4) turns out to be 6) ~ p,B4 
= .007. Accordingly, I use the relation 


E(B iyi |Bizs > 0,B,> 0) = .007 + .90B4. (A10) 


Equations (A7)—(A10) allow calculation of the relevant expectation of fu- 
ture casualty rates, which appears in equation (A6), V, = 272; EBy+i42/(1 + p)', 
conditional on observation of B through B,4, and for a given value of the 
discount rate p. The result takes the form 


We = Mo + Mi Bere, (A11) 
where py and pw, can be determined as functions of the p parameter.*” 
Specifically, these coefficients for selected values of p can be seen in table Al. 


Since p corresponds to the difference between the real rate of return and the 
real growth rate, the values of the u coefficients corresponding to the lower 


37 The general formulae are: 


Mo = [Bp,(1 — po) + p + p26) 
+ poOo(1 + p)(p + pip + p + pr — p21 + p — poAr)I, 


My = p26,/(1 apf) — p26;). 
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values of p would seem to be most pertinent. The empirical results that are 
reported in the text use the value p = .02 per year. 

The combination of equation (A11) with equations (A5) and (A6) allows 
calculation of the perceived permanent government purchases ratio, g7’, as a 
function of the variables (g?, k#1, By_-3, . . . ,By42) and the parameters (p, y, ao, 
1, Qo, M3, 44, 22), where p is the net real discount rate, y measures the reaction 
of current defense purchases to existing capital stock, and the a’s and a’s 
describe the effect of the array of B variables on defense purchases. The 
results are therefore expressed in terms of the general form of equation (6) in 
the text. Other coefficients that appear in the analysis (8’, 5, p1, p2, B—see, 
e.g., the expressions contained in n. 37 above) are treated as fixed at the 
values: 5’ = .16 per year, b = .34, p; = .06, p2 = .79, and B = .062. 
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Information Costs, Duration of Search, and 
Turnover: Theory and Applications 





Louls L. Wilde 


California Institute of Technology 


This paper uses a formal model of search over multiattribute alter- 
natives, analyzed in a product market setting, to investigate the 
theoretical foundations of the empirical literature on duration of 
search and turnover in product markets, labor markets, and mar- 
riage markets. A number of specific empirical predictions are also 
derived. In particular, whether “quality” is a “search” attribute or an 
“experience” attribute is related to the cost of search, the cost of 
inspection, the price of the good, and certain properties of the 
market distribution of price and quality. 


I. Introduction 


Several years ago Phillip Nelson observed that certain problems arise 
in extending search theory to deal with nonhomogeneous goods. In 
particular, he noted that “information about quality differs from 
information about price because the former is usually more expensive 
to buy than the latter” (1970, p. 311). To analyze the implications of 
this observation, Nelson divided goods into two classes, search goods 
and experience goods. Search goods are those for which utility is 
assessed before purchase by actual inspection. Experience goods are 
those for which utility is assessed after purchase by actual consump- 
tion. 


This paper is a substantially revised version of “Further Results on Inspection and 
Evaluation in Product Markets” (unpublished manuscript, California Institute of 
Technology, February 1978). I would like to thank Steve Lippman and Alan Schwartz 
for very helpful comments on an earlier draft. Comparative statics results are proved 
in an Appendix which is available from the author (California Institute of Technology, 
Social Science Working Paper no. 306, January 1980). 


[Journal of Political Economy, 1981, vol. 89, no. 6] 
© 1981 by The University of Chicago. 0022-3808/81/8906-0012$01.50 


1122 


INFORMATION COSTS 1123 


These definitions turn out to be very strong. So strong, in fact, that 
they lead to some difficulties. Consider, for example, experience 
goods. For these goods utility is assessed after purchase by actual 
consumption. But at least the price is observed before purchase. Since 
for experience goods this cannot, by definition, affect the choice of 
which brand to buy, Nelson was forced to assume that “consumers 
either sample at random from among all brands or from among those 
brands in the price range the consumer deems appropriate for him- 
self” (1970, p. 313). These assumptions require that consumers either 
ignore prices completely or have perfect information regarding 
prices, neither of which seems likely. 

The problem with Nelson’s definitions is that goods generally pos- 
sess a number of characteristics which can differ in their degree of 
observability. Thus a good might possess some “search characteristics” 
and some “experience characteristics.” Furthermore, whether a par- 
ticular characteristic is a search characteristic or an experience char- 
acteristic ought to be endogenously determined by the consumer. 

In previous work, I have analyzed a model in which goods are 
described by precisely two characteristics, price and quality. The mar- 
ket is described by an exogenous distribution of these characteristics. 
Sampling from this distribution is assumed to be costly. Once an 
observation is drawn, price is observed costlessly. Quality, however, 
can never be observed before purchase. In this case quality is an 
extreme example of an experience characteristic, one for which the 
cost of observation prior to purchase is infinite. The present paper 
extends this model to allow quality to be observed at some finite cost. 
The purpose is to explicate the relationships between information 
costs, duration of search, and turnover.' 

This model, introduced in Wilde (1980), applies to a number of 
cases discussed by Nelson. For example, it formalizes his prototypic 
experience good, canned tuna fish. Nelson suggested that “to evaluate 
brands of canned tuna fish, for example, the consumer would almost 
certainly purchase brands of tuna fish for consumption. He could, 
then, determine from several purchases which brand he preferred. 
For tuna fish there is no effective search alternative open. At the low 
price of experience, there is insufficient demand for specialized es- 
tablishments selling tastes of various brands of tuna fish” (1970, p. 
312). In this case, Nelson seems to be suggesting that the “price of 
experience” is low because the price of the good is low. However, the 


1 The formal analysis in this paper will be set in the product market. In this case 
turnover is a nonrepeat sale. Sec. IV will discuss applications to the labor market and 
the marriage market. In the labor market, turnover is a quit and in the marriage market 
turnover is a divorce. 
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price of experience can be low for other reasons as well. For example, 
if there is little variance in quality, then the price of experience is low 
because there is little chance of purchasing a low-quality good. In fact, 
it will turn out that by allowing quality to be observed before purchase 
an explicit expression for the price of experience can be derived. 
Uulizing this expression, this paper will also explicate the relationship 
between information costs and the price of experience. 

The focus of the model developed in Section II is on an imperfectly 
informed consumer who is interested in maintaining a flow of con- 
sumption of a good which is described by price and quality. The 
market offers various combinations of price and quality, but the 
consumer cannot costlessly observe them; by paying a search cost he 
or she can sample a good from the market, but only price is observed. 
Quality can be observed either before purchase by actual inspection 
(at some additional cost) or after purchase by actual consumption. 
Whether quality is observed before purchase (herein called inspec- 
tion) or after purchase (herein called evaluation), the consumer can 
return to the market and resample if the observed quality is too low. 
The initial problem is to characterize the optimal strategy for a con- 
sumer in such an environment. For a fixed utility function, joint 
distribution of price and quality, and cost of search, three possibilities 
arise depending on the cost of inspection: (1) If the cost of inspection 
is low enough, inspection will be the optimal strategy for low prices 
and drawing a new observation will be the optimal strategy for high 
prices. (2) If the cost of inspection is of an intermediate amount, 
evaluation will be the optimal strategy for low prices, inspection will 
be the optimal strategy for intermediate prices, and drawing a new 
observation will be the optimal strategy for high prices. (3) If the cost 
of inspection is high enough, evaluation will be the optimal strategy 
for low prices and drawing a new observation will be the optimal 
strategy for high prices. 

The formal results of the paper relate to the characterization of the 
consumer's optimal strategy which is summarized by statements 1-3 
above. However, these results have broad implications. Three im- 
portant general observations emerge from the analysis. 

First, as mentioned above, the price of experience can be defined 
analytically. This is important because it allows one to differentiate 
between the direct, short-run benefits of purchasing a good which are 
derived from its consumption and the indirect, long-run benefits of 
purchasing a good which are derived from evaluation of its quality 
attributes. 

Second, it will be demonstrated that in some cases quality will be a 
pure search characteristic (statement 1 above), in some cases it will be 
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neither a pure search characteristic nor a pure experience character- 
istic (statement 2 above), and in some cases it will be a pure experience 
characteristic (statement 3 above). Hence, not only is determining 
whether a particular good is an experience good or a search good a 
complex matter, but so is determining whether a particular charac- 
teristic of that good is an experience characteristic or a search char- 
acteristic. Since these distinctions have become very popular in the 
literature it is important to understand their limitations. 

Finally, the comparative statics associated with the characterization 
of the consumer’s optimal strategy will show that the theoretical 
foundations of much of the empirical work on duration of search and 
turnover in labor markets and marriage markets are unsound. 

A number of specific empirical predictions are also derived. It is 
shown that quality is more likely to be a search attribute (1) the lower 
is the cost of search, (2) the lower is the cost of inspection, (3) the 
higher is the price of the good, and (4) the larger is the variation in 
utility due to quality relative to the variation in utility due to price. 
Similarly, it is shown that quality is more likely to be an experience 
attribute (1) the higher is the cost of search, (2) the higher is the cost 
of inspection, (3) the lower is the price of the good, and (4) the smaller 
is the variation in utility due to quality relative to the variation in 
utility due to price. 

It is also shown that regardless of whether quality is a search 
attribute or an experience attribute, increases in search costs decrease 
the duration of search. In general, increases in inspection costs have 
ambiguous effects on the duration of search, but it is argued that if 
the variation in utility due to quality relative to the variation in utility 
due to price is low, increases in inspection costs are likely to increase 
the duration of search (at least by one measure). 

Finally, considerable attention is focused on turnover. Again, in the 
most general setting results tend to be ambiguous. This is especially 
true regarding the relationship between search costs and turnover. 
However, it is argued that if the variation in utility due to quality 
relative to the variation in utility due to price is low, increases in 
inspection costs are likely to increase turnover. 

This paper is organized as follows. Section II introduces the basic 
model. Section III considers the three cases mentioned above: (a) in 
which quality is always observed before purchase; (b) in which quality 
is observed before purchase for some prices but is observed after 
purchase for other prices; and (c) in which quality is always observed 
after purchase. Section IV discusses the empirical implications of the 
model in the labor market and the marriage market. Section V pro- 
vides a brief conclusion. 


1126 JOURNAL OF POLITICAL ECONOMY 
II. The Model: Notation, Definitions, and Assumptions 


In this section the basic model will be developed and an analytical 
expression for the price of experience derived. Section III provides 
comparative statics for the three cases mentioned in the introduction. 

Assume the good which is sought by the consumer has a lifetime of 
one period. Let U(p, q) be the total net value to the consumer of 
purchasing and consuming the good characterized by price p and 
quality g, where U is differentiable and bounded on R, X Ry with 
aU/dp < 0 and 0U/dq > 0. Let h(p, q) be the consumer’s subjective 
estimate of the market density of P and Q. For mathematical conve- 
nience, assume @ is strictly positive on Ry X R,. Define f(p) as the 
marginal density of P and g(q | p) as the conditional density of Q given 
P = p, both based on ¢. 

The cost of drawing an observation at random from ¢ is cs, where 
cs = 0. The cost of observing the true value of Q prior to purchase is 
Cr, where cr = 0 (both cs and cz are measured in the same units as U). 

The consumer can sample as many observations as desired from @ 
at the beginning of each period. Any number of inspections are also 
allowed. However, the consumer demands precisely one unit of the 
good each period.” The consumer’s objective is to maximize his or her 
expected discounted utility of consumption net of search costs. Sam- 
pling is assumed to be without recall, the horizon is infinite, and the 
discount rate is 8, where 0 < B < 1. 

Now suppose the consumer has sampled a good priced at p. Three 
reactions are possible: sample again without observing quality; inspect 
quality and then either buy the good forever or sample again; or 
evaluate quality and then either buy the good forever or sample 
again. Let v(p) be the expected value of drawing an observation of p 
and then proceeding optimally. Then 


u(p) = as + max {V,B(p), T(p)}, (1) 


where V is the expected value of search (i.e., the expected value of 
v[p] taken with respect tof), B(p) is the expected value of buying the 
good without observing quality and then proceeding optimally, and 
T(p) is the expected value of testing quality prior to purchasing the 
good and then proceeding optimally. 

To define B(p) and T(p) analytically, observe first that, once quality 
is known, the value of the optimal policy is the same whether quality is 
observed via inspection or evaluation. In other words, once quality 
has been observed, it is irrelevant how it was observed—any asso- 


* Search and inspection are assumed to be timeless in order to avoid confounding the 
direct costs of these activities with the opportunity cost of delaying the purchase 
decision. 
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ciated costs will by then be sunk costs. Define the value of the optimal 
policy once quality is known as k(p). Then 


k(p) = VGlq*(p)| p] + Fy EP etal praa, (2) 
where qg*(p) is that quality level which makes the consumer indifferent 
between consuming the good characterized by [p, q*(p)| and search- 
ing again for a new good from @. That is, q*(p) is defined by U[p, 
q*(p)| = Vi — B). The logic of (2) is that if g < q*(p), then the 
consumer rejects the good and samples again, receiving V (the ex- 
pected value of search). This happens with probability G[q*(p) |p. If 
q = q*(p), then the good is acceptable and the consumer receives the 
conditional expected value of U(p, q)/(1 — B), given q = q*(p). 

Using k(p), both B(p) and T(p) are defined straightforwardly. If the 
consumer buys the good priced at p, he or she receives the expected 
utility of consumption, given quality is unknown, plus the discounted 
value of an optimal policy once quality is known. Hence 


B(p) = EU(p, Q) + Bk(p). (3) 


If the consumer tests for quality prior to purchase, he or she pays cp 
and receives the undiscounted value of an optimal policy once quality 
is known. Hence 


T(p) = —¢y + k(p). (4) 


From (3), by adding and subtracting k(p) and rearranging, B(p) can 
be rewritten as B(p) = —[(1 — B)k(p) — EU(p, Q)] + k(p). The term in 
brackets is a cost which is directly comparable to ¢7. That is, one can 
define 


ca(p) = (1 — B)R(p) — EU(p, Q). (5) 
Then B(p) = —cp(p) + k(p). 


Equation (5) provides an analytical expression for Nelson’s price of 
experience. Furthermore, it has a natural interpretation. Recall that 
k(p) is the expected value of an optimal policy once quality is known, 
given the observed price is p. When quality is observed via actual 
consumption, this value is not obtained for one period (since in this 
model the good lasts for precisely one period). Hence (1 — B)k(p) is 
the gross opportunity cost of consuming the good, given quality is 
unknown. But consumption of the good yields utility, in this case 
EU(p, Q). The net opportunity cost of consuming the good, given 
quality is unknown, is the difference between these two quantities. 

In analyzing the optimal policy it will be convenient to make a 
transformation of variables in the definition of k(p). Since it is ulti- 
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mately final utility which matters to the consumer, the focus of (2) can 
be shifted from the conditional distribution of quality given price to 
the conditional distribution of utility given price. That is, let V(w | p) 
be the conditional distribution of utility given P = p. Then k(p) 
becomes 

zp) 


Mp=vviva—plpl+] 2 ewwlpdw, © 


where z(p) = lim,..U(p, q). Integrating (6) by parts gives 





2(p) 


» Ew p)du (7) 


Ma 


ip) =~ py [zpy — J 


This form of k(p) is intuitively less appealing than (6), but it is more 
powerful. For example, using (5) it is easy to show from (7) that 
VB) 
cp(p) =i Vw |p)dw. (8) 

where z(p) = U(p, 0). Expressing the price of experience in this form 
is useful analytically since it makes the comparison between B (p) and 
T(p) easier. Conceptually, it helps identify factors which might affect 
the decision whether to observe quality before purchase or after 
purchase. For example, suppose the cost of search, cs, increases. Then 
surely the value of an optimal policy will fall, that is, @V/Acs < 0. 
Equation (8) suggests that the price of experience will then fall as well. 
Hence the higher is the cost of search, the more likely it is that 
evaluation will be optimal. These and other results will be formalized 
in the following section. First, however, a few more preliminary as- 
sumptions will be needed. 

Using the definition of cg(p) introduced above in equation (5), the 
functional equation (1) can be rewritten as 


u(p) =—c¢s + max {V, k(p) — ce(p), k(p) — er}3 (9) 


The next step in characterizing an optimal policy is to compare V, k(p) 
ach Pp) anGih (Dye ic7. Unfortunately, without more structure on the 
joint distribution of price and quality, any number of things can 
happen. Nelson recognized this problem as well, stating the following: 
“Prior to using [a] brand, all the consumer knows is its price. But this 
knowledge provides only the roughest sort of guide to choice, for the 
consumer must assume a generally positive relationship between price 


* It is convenient at this point to assume that the functional equation (1) has a unique, 
bounded solution. In this case V is unique and well defined. Existence of a bounded 
solution to (1) is straightforward. Uniqueness can be established along traditional lines 
if search and inspection are not assumed to be timeless (see Wilde 1979) or by appealing 
to the appropriate generalization of MacQueen and Miller (1960). 
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and quality. In the absence of any other information, the consumer 
would not know if he were better off experimenting with low- or 
high-priced brands” (1970, p. 313). To get around this problem, 
Nelson converts the joint distribution of price and quality to a dis- 
tribution of net utility and proceeds under the assumption that evalu- 
ation is always used to observe quality. In the present analysis, since 
the decision whether to observe quality before purchase or after 
purchase is endogenous, a more formal approach is needed. The 
standard assumption is that 0V(w | p)/Op > 0. This assumption im- 
plies that, on average, higher price is associated with lower utility even 
though higher price may well be associated with higher quality. Be- 
cause this assumption is discussed at length in Wilde (1980), it will be 
assumed to hold here without further rationalization.‘ 

Several implications follow directly from the assumption that 
AV(w|p)/dap > 0. First, it implies that the price of experience is 
increasing in the price of the good. To see this, simply take the 
derivative of (8) with respect to p: 


V(1i-B) 
CRP) =| [OV (w | p)/apldw — V[z(p)| p]z’(p) 


2(p) 


V(1—-B) 

= fe [AW (w| p)/opldw, 
z(p 

since V[z(p) | p| = 0 by definition. Second, 0V(w | p)/dp > 0 implies 

that observing quality before purchase becomes a less desirable alter- 

native to sampling again as the observed price increases. That is, using 


(6), 
2p) 
T'(p)=k(~)=—-[” [aww] pylepldw <0. (10) 


Moreover, B'(p) = k'(p) — cg(p) < T'(p) since cz(p) = 0. Hence 
observing quality after purchase also becomes a less desirable alterna- 
tive to sampling again as the observed price increases, and it does so at 
an even faster rate than observing quality before purchase. 


III. The Optimal Policy 


It turns out that three qualitatively distinct forms of the optimal policy 
are possible. If cr is low enough, then inspection always dominates 
evaluation. If cr is somewhat higher, then evaluation dominates in- 


* Even in the case where cz is infinite, some formal structure must be placed on WV. See 
Wilde (1980) or Hey and McKenna (1981) for more details. 
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spection for one set of prices and inspection dominates evaluation for 
another set of prices; and, if cp is high enough, then evaluation always 
dominates inspection. These three cases are analyzed next. 


A. Inspection Only 


In comparing the expected value of observing quality before pur- 
chase to the expected value of observing quality after purchase, the 
crucial parameters are cr and c,z(p), the respective costs of these two 
activities. It is clear from (3) and (4) that B(p) 2 T(p) as cp 2 cp(p). But 
Cz(p) is increasing in p so that c,(0) = cp implies cg(p) = cr for all p = 0. 
Hence the expected value of observing quality before purchase will 
always be greater than the expected value of observing quality after 
purchase when ¢,(0) > cr. This is obviously most likely to be the case 
when ¢,(0) is large and cr is small. Equation (8) suggests that c;(0) is 
most likely to be large when V(1 — £) is significantly greater than z(0). 
But V(1 — g) = U[0, q*(0)] and z(0) = U(0, 0). Hence c,(0) is most 
likely to be large when q*(0) is high. In other words, inspection is 
likely to dominate evaluation for all prices when the cost of inspection 
is low or when few quality levels are acceptable even at low prices. The 
latter is the case, for example, when the cost of drawing observations 
from ¢ is low or the variation in utility due to quality is high relative to 
the variation in utility due to price.® 

In the remainder of this subsection it will be assumed that ca(0) = 
Cr; that is, it will be assumed that 


V(1-—B) 
ih V(w | 0)dw = cr. 


2(0) 


This implies that B(p) < T(p) for all p 2 0, in which case B(p) can be 
ignored completely; characterizing the optimal policy reduces to 
comparing T(p) and V. Two possibilities arise. In the first there exists 
a unique finite price, say p*, such that observing quality prior to 
purchase is optimal for p < p* and sampling again is optimal for p = 
pz (see fig. 1). The critical price is defined by V = T(p*).® In the 
second, V is strictly less than T(p) for all p = 0 so that observing quality 
prior to purchase is always optimal.” 


° There is no guarantee that ¢a(p) > 0. In particular, it might be the case that Cp(0) = 
0. It can be shown, however, that there exists « > 0 such that GAO) = Oil @ Ke 

6 Since OW (w | p)/Op > 0 only implies T’(p) < 0, it is possible that the equation T(p)= 
V does not have a unique solution. However, if this is the case, then {p|T(p)=V}= 
[pr, ©), where pr = inf {p | T(p) = V}. If pr is defined in this fashion, it satisfies the 
formal requirements stated in the text. 

‘Formal proofs of these assertions have been omitted since they are trivial. It is 
apparent, however, that since c,(0) > 0, there is always a small enough value of cr 
(possibly 0) such that B(p) < T(p) for all p = 0. 
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T(p) 





Fic. 1.—Definition of p* 


Assume that p7 exists and is finite. Then p7 and q* partitionR, X Ry 
into three sets (see fig. 2). In region I, p > px, so the good is rejected 
outright. In region II, p < p* but q < q*(p), so that quality is observed 
prior to purchase but the good is subsequently rejected (i.e., not 
purchased). In region III, p < p* and q 2 q*(p), so that quality is 
observed prior to purchase and the good is subsequently accepted 
(i.e., purchased). 

How do changes in cs and c,7 affect this partition? To answer this 
question one needs to know how changes in cs and cr affect p* and q*. 
The following results are straightforward but tedious and can be 
found in the (unpublished) Appendix to this paper. It is shown there 
that dpz/dcs > 0 and dp*/dcy < 0. Of course V falls as either cg or cp 
rises. Since U[p, q*(p)] = V(1 — B) by definition and U is increasing in 
q, this implies dq*(p)/dcs < 0 and dq*(p)/dcp < 0. In other words, an 
increase in the cost of drawing observations from ¢ will make inspec- 
tion an optimal strategy for more prices while an increase in the cost 
of inspection will make inspection an optimal strategy for less prices. 
An increase in either cost will make more quality levels acceptable for 
any given price. 

Next, consider how changes in cs and cr affect the number of 
observations which must be drawn from ¢ (whether or not quality is 
inspected) before an acceptable good is found. Define 


p 








Fic. 2.—Price-quality combinations, case A: I = reject outright, II = inspect and 
subsequently reject, III = inspect and subsequently accept. 
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1s = [C1 - Clg") | pD/ pap 


and 


In = J "Gtge(p)| PUP ap. 


Here /; is the probability that a random price-quality combination will 
be inspected and rejected, and /; is the probability that a random 
price-quality combination will be inspected and accepted. It is again 
shown in the Appendix that the following hold: 0/;/dcs > 0 and 
OIp/dcs = 0, while O15/dcp 2 0 and O1,/Ocp < 0. 

Consider first an increase in the cost of drawing observations. Since 
p; increases, inspection becomes an optimal strategy for more prices. 
Furthermore, since q*(p) decreases, more quality levels are acceptable 
for any given price. Hence the expected number of observations 
which must be drawn from ¢ before an acceptable good is found 
(1/s) falls. The effect of an increase in cg on Ip is ambiguous, how- 
ever, because the decrease in q*(p) counteracts the increase in pe 
rather than reinforcing it. 

Precisely the opposite happens when the cost of inspection in- 
creases. Since p7 falls, inspection becomes an optimal strategy for less 
prices. Furthermore, since g*(p) still decreases, fewer quality levels are 
acceptable for any given price. Hence I, falls. Since the effect on Ts is 
ambiguous, it is impossible to assert that an increase in the cost of 
inspection reduces the expected number of observations which must 
be drawn from ¢ before an acceptable good is found. 


B. Inspection and Evaluation 


It was assumed throughout Section IIIA that €g(0) = cr so that inspec- 
tion dominated evaluation for all prices. Recall that 


Vi1—B) 
cp(0) =|. WV(w | O)dw. 
2(0) 

It is apparent from this equation that 0c;(0)/dc,; < 0 and O0c3(0)/Ocr < 
0 since V is decreasing in either cost and WV(w | 0) > 0 for w close to 
z(0). Hence increases in either cs or cp make it less likely that inspec- 
tion will dominate evaluation for all prices. 

Assume that ¢g(0) < cp. Then two possibilities arise. In the first, 
inspection is optimal for one set of prices and evaluation is optimal for 
another set of prices. In the second, inspection is never optimal. The 


first possibility will be analyzed now and the second will be considered 
in Section IIIC.® 


* Again, formal proofs that ranges of cy and Cr exist such that both possibilities occur 
have been omitted. The text following identifies two necessary conditions for inspection 
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T(p) 
| B(p) 
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Fic. 3.—Definition of pgr and pF 


In order for inspection to be optimal for one set of prices and 
evaluation to be optimal for another set of prices two conditions must 
be met. Assuming cg(0) < cz, the first can be stated as follows. 

ConpiTI0n 1: The cost of inspection must not be so great that 
evaluation dominates inspection for all prices. 
When condition | holds, since B'(p) < T'(p) < 0, there exists a finite 
price pr such that evaluation dominates inspection for p < pyr and 
inspection dominates evaluation for p = pp. The critical price is 
defined by T(pgr) = B(pir) (see fig. 3). 

Assuming condition | holds, the second condition can be stated as 
follows. 

CONDITION 2: The expected return to search must be low 

enough that for some prices inspection dominates drawing 

another observation from @. 
Condition 2 requires that for some p > p37, T(p) > V. However, as in 
case A, it might be that T(p) > V for all p = p3,. Assume this does not 
hold. Then there exists a finite price p} such that inspection is optimal 
for p © [p3r, p7], and drawing another observation from ¢ is optimal 
for p 2 p}. Again, as in Section IIIA, the critical price is defined by 
Lip iaav (see figs 3) 

Overall, the situation dealt with in this case is the most interesting of 
the model because it shows that quality can be a search characteristic 
for some prices and an experience characteristic for other prices. 
That is, the configuration of utility, search costs, inspection costs, and 
the joint distribution of price and quality are such that inspection, 
evaluation, and drawing another observation from ¢ are all optimal 
strategies for various prices. In general, p7, per, and q* partition R, X 
R, into five regions (see fig. 4). As in case A, region I includes prices 


to be optimal for one set of prices and evaluation to be optimal for another set of prices. 
There seems to be little value in making these more formal. 

® Again, since 0W(w| p)/dp > 0 only implies B’'(p) < 0, the solution to the equation 
T(p) = B(p) may not be unique. In this case {p | T(p) = B(p)} = [pr, ©) where pz = inf 
{p|T(p) = B(p)}. If pir is defined in this fashion, it satisfies the formal requirements 
stated in the text. 
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7a (p) 
7 





Fic. 4.—Price-quality combinations, case B: I = reject outright, Il = inspect and 
ultimately reject, III = inspect and ultimately accept, IV = purchase once and only 
once, V = purchase once and forevermore. 


for which rejecting the good outright is optimal. In region II, pir < p 
< p% but g < qg*(p), so that quality is observed prior to purchase but 
the good is subsequently rejected; and in region III, p37 < p < px, but 
q 2 q*(p), so that quality is observed prior to purchase and the good is 
subsequently accepted. There are two additional regions, though. In 
region IV, p S per and q < q*(p), so that the good is purchased 
without quality having been observed but is not repurchased. In 
region V, p < pgr and q = q*(p), so that the good is purchased without 
quality having been observed and is repurchased in all subsequent 
periods. 

How do changes in cs and cr affect this partition? As before, to 
answer this question we need to know how changes in cg and cr affect 
pir, p?, and q*(p). As in case A, dpx/dcs > 0 and dp#/dcp < 0. Also, 
dppridcs > 0 and dpjr/dcr > 0. Finally, dq*(p)/des < 0 and dq*(p)/dcr 
< 0. 

An increase in the cost of drawing observations will increase the set 
of prices for which evaluation is an optimal strategy. Furthermore, it 
will increase the set of prices for which either inspection or evaluation 
is an optimal strategy. An increase in the cost of inspection will 
increase the set of prices for which evaluation is an optimal strategy, 
will decrease the set of prices for which inspection is an optimal 
strategy, and will decrease the set of prices for which either inspection 
or evaluation is an optimal strategy. Finally, an increase in either cy or 
cr will make more quality levels acceptable at any given price. 

It is again useful to consider how increases in cy and cr affect the 
probability that a random price-quantity combination will fall in any 
given region. Define 


PBT 
Bs=J> "= Glg*(p) [PDS ap, 
En = J 'Gtg*p)|plitprap, 
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15= J — Clyro) lp seovap. 


and 


PT 
Ip = J Clap) |plftpndp. 


Here I;, Is, Ep, and Ey are the probabilities a random price-quality 
combination will fall in region I, III, IV, or V, respectively. It can be 
shown that 











grees wars y Gp OE 
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While the majority of these partial derivatives are ambiguous in 
sign, a number of interesting observations can still be made. First, the 
probability that a random price-quality combination will be acceptable 
(whether quality is inspected or evaluated) is given by Is; + Ey. Hence 
the expected number of observations needed to locate an acceptable 
good is 1/1; + Es). As before, this quantity is decreasing in cs and 
ambiguous in ¢7. Second, both 0F 5/dc, > 0 and dE ;/dc7 > 0. That is, 
the probability that a random price-quality combination will be pur- 
chased without quality having been observed and subsequently repur- 
chased is increasing in either cost. This is because when either ¢, or ¢r 
increases, the set of prices for which evaluation is optimal increases 
and the set of quality levels which are acceptable for any given price 
also increases. However, this necessarily implies 0F ;/dcs and 0E ,/dcr 
are ambiguous in sign. This last observation is important. When a 
price-quality combination falls in region IV, the good is purchased 
without quality having been observed but is not subsequently repur- 
chased. This “brand disloyalty” is analogous to a job quit in the labor 
market or a divorce in the marriage market. It is crucial to recognize 
that the likelihood of these events does not appear to be systematically 
related to either search costs or inspection costs. This point will be 
discussed in more detail in Section IV. 
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Fic. 6.—Price-quality combinations, case C: I = reject outright, 1V = purchase once 
and only once, V = purchase once and forevermore. 


C. Evaluation Only 


The final case is one in which c,(0) < cz, but one of the conditions of 
case B does not hold.!° Under these circumstances evaluation will 
always dominate inspection for any acceptable price.!! Both this case 
and case A are, in a sense, special cases of case B. Here the relevant 
critical price, pz, is defined by B(p3) = V (see fig. 5). The elements of 
the associated partition correspond to regions I, [V, and V of case B 
(see fig. 6). Of course cr has no effect on this partition. The effects of 
Cs are identical to those in case B (where pz now replaces pj7). 


IV. Applications 


The model analyzed in this paper has obvious analogues in the labor 
market and the marriage market. The product market has been used 
as the setting to this point because much of the relevant literature 
deals with consumer behavior (e.g., Nelson 1970; Lippman and 


© This will certainly be the case when cr is large enough, since U is bounded. 
"That is, B(p) > T(p) for any p such that V < max {B(p), T(p)}. 
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McCall 1979a; Wilde 1980; and Hey and McKenna 1981). However, 
many of the important qualitative implications of the model emerge 
more sharply in the labor market than in the product market. 

The labor market analogue concerns an unemployed worker 
searching for a job. This individual pays a search cost in order to 
sample vacancies, but only the wage rate is observed. Nonwage char- 
acteristics can be observed either by paying an inspection cost or by 
taking the job. 

Two aspects of the labor market analogue are of primary inter- 
est—unemployment and the quit rate. Unemployment is related to 
the expected duration of search. The quit rate is related to the 
probability that a job which is accepted will subsequently be rejected. 

Suppose that the distributions of net utility associated with jobs, 
conditional on the wage rate, are stochastically decreasing in the wage 
rate. Then, in the most interesting case, there will be a range of low 
wages for which renewing search is optimal, a range of intermediate 
wages for which inspection is optimal, and a range of high wages for 
which evaluation is optimal (see Sec. IIIB above). 

The effects of an increase in information costs on the duration of 
search seem straightforward. An increase in the cost of search makes 
both inspection and evaluation more desirable alternatives. Hence the 
duration of search should fall. An increase in the cost of inspection 
will likely have ambiguous effects on the duration of search because it 
makes evaluation a more desirable alternative, but it also makes in- 
spection a less desirable alternative. 

The effects of an increase in information costs on the quit rate are 
less obvious. The argument would seem to go as follows: A quit 
requires that two events occur: (1) a wage rate is observed for which it 
is optimal to take the job without observing its nonwage component 
first and (2) the nonwage component turns out to be too low so that it 
is optimal to quit and renew search once it is observed. But an increase 
in either search costs or inspection costs makes evaluation optimal for 
more wages. In particular, there are ower wages for which evaluation 
is now optimal. These jobs must have higher nonwage components in 
order to be acceptable; that is, for them the probability of turnover is 
higher. Hence, the overall probability of a quit should increase as 
either cost increases. 

Unfortunately, these heuristics are incomplete because neither con- 
siders the fact that an increase in either cs or cr will decrease the 
return to search, making lower values of the nonwage component 
acceptable at any given wage. While this reinforces the argument 
regarding the relationship between information costs and the dura- 
tion of search, it weakens the argument regarding the relationship 
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between information costs and the quit rate. In fact, an increase in 
either cost could either increase or decrease the quit rate.’ 

To justify these assertions formally the model of Section HI must be 
used. Consider first the duration of search. There are really two 
measures which are of interest. First, E.; + I; gives the probability that 
a random price-quality observation will ultimately be acceptable (re- 
gardless of how quality is observed). Second, Ey + Es + Is gives the 
probability that a random price observation will cause search to cease 
(although perhaps only temporarily). It was stated in Section III that 


OCs OCr 


= 
It can also be shown that 


O(ErtEstIs). 9 ang SErtEs + Is) 


OCs OC eee 


Purchases of goods for which quality has not been observed are really 
part of the search process. Hence 1/(Es + Js), which might be called 
the “pure duration of search,” is the proper expression for the ex- 
pected duration of search. However, empirically it would often be 
impossible to differentiate between observations which fall in region 
IV and observations which fall in region V. In other words, the 
observed duration of search would often correspond to 1/(Ey + Es + 
Is), which might be called the “effective duration of search.” 

Similar problems arise with respect to turnover. In fact, there are 
three measures of turnover embedded in this model, one ex ante and 
two ex post. The ex ante measure is simply Ff; it gives the probability 
that a random price-quality observation will be purchased once and 
only once. One ex post measure is what might be called the “pure 
failure rate for evaluation,” Ep = E,p/(Ep + Es); it gives the conditional 
probability that a good will be rejected given that it is purchased 
without quality having been observed. However, empirically it would 
often be hard to differentiate between observations which fall in 
region III and observations which fall in region V. Hence the other ex 
post measure is what might be called the “effective failure rate for 
evaluation,” Ey = E,l(Ep + Es + Is); it gives the conditional probabil- 
ity that a good will be rejected given that it is purchased (regardless of 
whether quality is observed before purchase or after purchase). It 
turns out that none of these measures is systematically related to 
either search costs or inspection costs. 


® See Lippman and McCall (1979) for an extensive discussion of these points ina 
related model. 
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The third application of this theoretical framework is to the mar- 
riage market. The marriage market analogue concerns an unwed 
individual searching for a marriage partner. This individual pays a 
search cost in order to sample potential partners, but only some 
characteristics are observed. Other characteristics can be observed 
either by paying an inspection cost or by getting married. 

The aspect of the marriage market analogue which is of primary 
interest is dissolution. The most complete analysis of the relationship 
between information costs and probability of dissolution is provided 
by Becker, Landes, and Michael (1977). These authors consider two 
cases, one in which remarriage is impossible and one in which the 
remarriage market is identical to the marriage market. 

Consider first the case in which remarriage is impossible. When 
remarriage is impossible, the value of dissolution is a constant. In the 
model analyzed in this paper an analogous assumption is that the 
value of not repurchasing a good is a constant, say V. Then instead of 
(3), B(p) would be defined as 


B(p) = E[U(p, Q) + pk(p), (3') 


where 


kip) = Volgp) |p +f SED og | p)dq, 

a*(p) B 
and q*(p) is defined by U[p, g*(p)] =V(1 — B). The definition of T(p) 
would remain as in (4). This modification affects the comparative 
statics of the model in a straightforward way; for p < pz, dq*(p)/dcs = 
0 = dq*(p)/dc7. Hence as before, OE ;/0cs > Oand 0E5/dcr = 0, but now 
0E,/dc; > 0 and 0E;/0cr > 0. Furthermore, as before, 0Fs/Ocs < 0 
and 0E£,,/0cp > 0, but now dEs/Ocp < 0 and OE ,x/Ocr > 0. However, it 
remains true, even when the value of not repurchasing a good is 
constant, that E, and Ey bear no systematic relationship to informa- 
tion costs. 

These results are, on the surface, consistent with those of Becker, 
Landes, and Michael. These authors assert that because “the proba- 
bility of entering into a mismatch” would be greater, “an increase in 
either the cost of intensive or extensive search would increase the 
probability of dissolution” (1977, p. 1150)."° The definition of “prob- 
ability of dissolution” which these authors use to arrive at _this 
theoretical conclusion is apparently Ey. The problem is that Ep is 
unobservable (it is E » which is observed), and even when remarriage is 


‘8 Intensive search corresponds to inspection and extensive search corresponds to 
drawing another observation from @. Hence the cost of intensive search is cp and the 
cost of extensive search is Cg. 
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impossible, there is no systematic relationship between E, and either 
Cg OF Cy! 

The situation is even more difficult when the remarriage market is 
identical to the marriage market. Here, just as in the labor market 
analogue, none of the partial derivatives relating information costs to 
turnover can be signed. 

These are, of course, somewhat negative observations. In addition, 
some positive conclusions can be drawn from this analysis. It has been 
shown that an increase in search costs always decreases the duration 
of search. An increase in inspection costs will generally increase the 
duration of search if the effect on q*(p) is small relative to the effect 
on par or pr. This will be the case when the variation in utility due to 
quality is small relative to the variation in utility due to price. 

There appear to be no general conclusions to be drawn regarding 
the relationship between search costs and turnover. However, as 
above, an increase in inspection costs will generally increase turnover 
if the effect on q*(p) is small relative to the effect on Par OY pz. This is 
again the case when the variation in utility due to quality is small 
relative to the variation in utility due to price. 


V. Conclusion 


This paper has established a number of strong results which go 
against the grain of the extant literature. These results obtain because 
goods are viewed as multicharacteristic composites in which indi- 
vidual characteristics have specific informational properties. 

A number of extensions are possible. An obvious one is to consider 
goods which are described by more than one nonprice attribute. In 
this case the consumer would need to decide which attributes to 
observe before purchase and which to observe after purchase. At each 
stage, reservation levels would be defined as functions of the actual 
levels of those attributes already observed. Finally, the consumer 
would also need to decide in which order to observe attributes. A 
model of this nature would have important applications in psychology 
and consumer research, as well as economics. However, one might 
argue that actual consumers do not use optimizing strategies as com- 
plicated as this model would require. Rather, consumers would use 
various satisficing strategies. In fact, one might apply the same argu- 
ment to the model analyzed in this paper. 

There is much to be said for this argument, but satisficing strategies 
are even less well understood than optimizing strategies. Further- 
more, most empirical work on the duration of search and turnover is 
based, implicitly or explicitly, on simpler versions of the optimizing 
model developed in this paper (e.g., Becker, Landes, and Michael 
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1977). In any event, the issue offers a number of intriguing topics for 
future research. 
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Macroeconomic Policy, Exchange-Rate 
Dynamics, and Optimal Asset Accumulation 





Maurice Obstfeld 


Columhia Unwersity 


The paper develops a model of exchange-rate and current-account 
determination for a small economy peopled by infinitely lived, 
utility-maximizing households. In this setting, a central-bank pur- 
chase of foreign exchange has no real effects when central-bank 
foreign reserves earn interest at the world rate and the proceeds are 
returned to the public. In contrast, an increase in the monetary 
growth rate does have real effects, even in the long run. The model 
developed here implies that an increase in government spending 
may lead to a surplus on current account. The external adjustment 
process predicted by the model is one in which consumption, real 
balances, and external assets all rise or fall simultaneously. 


Introduction 


This paper studies exchange-rate determination and the external 
adjustment process in a small economy peopled by infinitely lived, 
utility-maximizing households. The setting is similar to one proposed 
by Kouri (1976), in that agents are assumed to consume a single good, 
to hold domestic money, and to have access to a world market in 
consumption loans. But the approach departs from those prevalent in 
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recent balance-of-payments models by basing saving behavior and 
money demand on explicit, intertemporal utility maximization.! 

The analytical framework is adapted from the seminal work of 
Sidrauski (1967) and employs the perfect-foresight equilibrium con- 
cept explored by Brock (1974), Calvo (1979a), and Fischer (1979). In 
contrast to these contributions, however, the model developed below 
allows agents’ subjective rate of time preference to be endogenously 
determined in a manner suggested by Uzawa (1968) rather than 
fixed.? This modification permits the small economy we study to attain 
a stationary long-run equilibrium under conditions of perfect capital 
mobility.* 

The plan of the paper, and its main results, are as follows. Section I 
describes the structure of the economy and the typical household’s 
maximization problem. Particular attention is paid to the budget 
constraints of the central bank and government. Section II closes the 
model by imposing continuous money-market equilibrium and en- 
dowing agents with perfect foresight concerning the future paths of 
the price level and government transfer payments. The resulting 
dynamic system possesses a unique stationary position and a single 
convergent path, along which consumption, real money balances, and 
the stock of foreign claims all rise or fall simultaneously. This conver- 
gent saddlepath is an equilibrium path and is the focus of the analysis 
that follows. 

Section III analyzes the effects of some unanticipated mac- 
roeconomic policy actions. A central-bank purchase of foreign ex- 
change is found to have no real effects when official foreign reserves 
earn interest that is distributed to the public. In contrast, an increase 
in the monetary growth rate does have real effects: It occasions 
exchange-rate overshooting (in Flood’s [1979 ] sense) and a current- 
account surplus and leads, in the long run, to higher levels of con- 
sumption and foreign claims. Finally, the model implies that an in- 
crease in government spending may induce a surplus on current 
account, contrary to the usual presumption. A deficit is possible when 
government spending produces a public good having a high marginal 
value to consumers. The explicit specification of agents’ objective 


* Research along similar lines is reported in papers by Calvo (1979b, 1980), Helpman 
(1979), Razin (1979), and Stockman (1980). Obstfeld (1980) describes a two-country 
extension of this paper’s model. 

* Endogenous time preference has been discussed in the trade literature in papers by 
Calvo and Findlay (1978) and by Findlay (1978). Kouri (1980) adopts a consumption 
function suggested by Uzawa’s (1968) theory. The present approach is applied in a 
somewhat different context in Obstfeld (1981). 

* This could be accomplished also by allowing bonds (in addition to real balances) to 
enter consumers’ utility functions, as in Calvo (1980). 
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functions is particularly useful in Section III, for it permits a natural 
welfare analysis of alternative macroeconomic policies. 

Section IV contains concluding remarks. Appendices supply the 
technical details justifying some of the paper’s assertions. 


I. The Model 


We consider an open economy inhabited by identical households, 
each maximizing its welfare over an infinite lifetime. As in Kouri 
(1976), there is a single, perishable, consumption good, and house- 
holds’ financial wealth is divided between domestic fiat money and 
internationally traded bonds denominated in foreign currency. The 
economy is small, and it can therefore influence neither the foreign 
currency price of the consumption good, P*, nor the world bond rate, 
r, both of which are assumed constant. The link between the domestic 
price level, P, and P* is provided by the relationship P = EP*, where 
E denotes the domestic money price of foreign exchange. This ex- 
change rate is allowed to float freely by the monetary authority. On 
the assumption that foreigners do not hold domestic money, E adjusts 
to maintain equality between the real money supply and domestic real 
money demand at each instant. We adopt the normalization P* = 1. 

The number of households in the economy is, for convenience, also 
taken to be 1. The household’s instantaneous utility is a separable 
function U(cjm) = u(c) + v(m) of consumption, c, and real money 
holdings, m, defined as nominal money holdings, M, deflated by the 
home price level, m = M/P. The functions u(-) and v(-) are nonnega- 
tive, strictly concave, and twice continuously differentiable, with the 
property that 

lim u’(c) = lim v'(m) = ~, (1) 
c>0 m—0 

Assumption (1) eliminates the possibility of corner solutions. It is 
clear, in addition, that the derivatives u’(-) and v'(-) are every- 
where strictly positive. 

The household seeks to maximize the discounted sum of future 
instantaneous utilities, 


V a U (¢.,m,)e “dt. (2) 


The discount factor A; is defined by 


t 
A; = | dds, (3) 
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where 6, is the instantaneous subjective discount rate at time s.4 Fol- 
lowing Uzawa (1968), we postulate that 6; is a function of utility at 
time s, 


8s = 8[U(c,,m,)], (4) 
and that the function 8(-) is positive and satisfies 
6'(U) > 0, &"(U) > 0, &(U) — US'(U) > O. (5) 


The first two conditions listed in (5) are in no way implied by the 
hypothesis of endogenous time preference but are imposed because 
they are analytically convenient.® The last of the conditions ensures 
that the present discounted value U/8(U) of a stationary utility stream 
U is increasing in U. 

At each moment t, the household must allocate its income between 
current expenditure and the accumulation of real wealth. Real out- 
put, y, is taken to be exogenous and fixed. The other components of 
expected disposable income are interest payments from abroad, equal 
to the world bond rate, r, times the real value F : Of the family’s net 
foreign claims; expected real transfers from the government, 7;; and 
expected capital gains on real balances, equal to —7ym,, where 7, 
denotes the expected inflation rate. Defining the household’s real 
assets at time ¢ (excluding the capitalized value of expected future 
transfers) as 


a, = 2 + m + Fi, (6) 


the flow constraint linking asset accumulation to saving may be writ- 
ten as 


a; her: ale Tp a CF a (7; oie r)M:, ) 


while the intertemporal budget constraint takes the form 


co 


i) Clee (are n)\mldt = ag ah er, dt. (8) 
4 0 


Restriction (8) is important in the present setting of perfect capital 
mobility, for it rules out the possibility that the household can attain 
unbounded utility by borrowing arbitrarily large sums in the world 


* When 6, = 6, a constant, for all s, we have the usual discount LACtOn Ay = OF; 

* The nature of this convenience becomes apparent in applying the maximum prin- 
ciple to solve the household’s lifetime problem and in proving the existence of a stable 
perfect-foresight equilibrium path converging to the economy’s stationary state. These 
points are taken up again in context. A general discussion of the time preference 
concept is found in Koopmans, Diamond, and Williamson (1964). 
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capital market and meeting all interest payments through further 
borrowing. As Appendix B shows, (7) and (8) confine the household 
to paths along which 


co 
at +f eS ds = 0 
t 


for all ¢, so that household net worth is always nonnegative. 

The household’s problem is to find {c¢;,m,} maximizing lifetime 
welfare V subject to the constraints (3), (7), and (8), given an initial 
level a) of real assets. The calculations are simplified by changing 
variables from ¢ to A (“psychological time”) using the fact, implied by 
(3), that dA = 6[U(c;,m,)]dt. The household’s problem may then be 
expressed as that of choosing time paths for consumption and real 
balances that satisfy (8) and® 





maximize ; S[U (cm) dA 
aig Se ON auton Seton ae 
subject to Te 5[U (c,m)] 


Necessary conditions for an optimal program are given by the 
maximum principle.” These require that c and m be chosen so that for 
each value of the discount factor A, the Hamiltonian 


U (c,m) wah 6G lar oa] 


fe Cel SU (em) ] 9) 


is maximized. In (9), A = A, is the co-state variable, which is inter- 
preted as the shadow value, in utility terms, of real assets. The implied 
first-order conditions (at the interior maximum) are 


(1 — (8'/8){U + Afra +7 -—c —(4 +1)m]})U= 2, (10) 


(1 — (8'/8){U + Afra + 7 -—c — (7 + 1)m)})Um_ =A(T +7). (AY 


An additional necessary condition is that the shadow price of assets 
evolves according to the law, 


oe (A), (12) 


Equations (10) and (11) imply the usual necessary condition of static 
utility maximization, 


° The psychological time subscript, A, is omitted for the sake of notational simplicity. 

7 See Arrow and Kurz 1970. If U(c,m) is strictly concave and both consumption and 
real balances are normal goods, the conditions listed in (5) imply that the maximand 
U(c,m)/8(U (c,m)] is strictly concave. 
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x(Cym~) =U, (cm, /U (cm, = 7, +7, (13) 

while (12) implies that the time derivative of Ax is given by® 
Ne = Ae{SLU (cem)] — r}. (14) 


It remains to specify the behavior of the government and the 
central bank. We consolidate the balance sheets of these two entities 
and work with a single public-sector budget constraint. A key as- 
sumption is that the stock of central-bank foreign reserves earns 
interest at the world rate r. 

The public sector consumes goods and makes (net) transfers, 
financing any deficit through money creation. Thus, all government 
debt is purchased by the central bank, none by private citizens. Let- 
ting g denote the (constant) level of real government consumption per 
household, the public sector’s budget constraint is g = M,/P,+7rR — 7 
= (M,/M,)m, + rR — 7, where R denotes the (constant) level of 
central-bank foreign reserves, measured in terms of output, and 7;, as 
before, denotes net real transfers. 

We shall assume that the central bank varies 7, in such a way as to 
hold the rate of monetary growth M,/M, constant at bu.” The level of 
transfers is then determined according to the formula 


T= pm TR = g. (15) 


A final assumption is that p is always chosen so that w + r > 0. This 
inequality guarantees the existence of a stationary state. 


II. Perfect-Foresight Equilibrium Dynamics 


The model of the previous section is closed by the assumption that the 
path of the economy is a perfect-foresight equilibrium path, in the 
sense of Brock (1974). Let {P;,} be a (differentiable) price-level path 
and {Py} = {we"M, + PywR — Pg} the associated path of nominal 
transfer payments from the government, where M, denotes the 
nominal money stock at ¢ = 0. Acting on the belief that these paths of 
the price level and nominal transfers will prevail, the household can 
calculate expected real transfers {7;} and determine optimal paths for 


* Only necessary conditions (10) and (11) differ from what they would be in the 
standard intertemporal optimization framework, where 6, = 5, an exogenous constant. 
In the standard setup, (10) is replaced by U, = d. But in the present context, an increase 
in today’s consumption affects not only today’s instantaneous utility but also the dis- 
count factor applied to all utilities enjoyed in the future. Note also that if the instan- 
taneous subjective discount rate 6 were constant, necessary condition (14) would rule 
out the possibility of a stationary state unless it so happened that 6 and r were equal. 

* This assumption is crucial for the foreign-exchange intervention “neutrality” prop- 
osition derived below and differs from the one made by Kouri (1976), who takes 
government consumption to be the residual item. 
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its consumption rate and real balances, subject to (7) and (8) with 
initial real assets ad) = (y/r) + Fo + (Mo/Po). The path {P;} is a 
perfect-foresight equilibrium path if the desired path of real balances, 
{mm}, equals the actual path, that is, if 


m, = M,/P, = e“MJP* (16) 


for allt = 0. 

The necessary conditions derived in the previous section may be 
used to find the collection of price-level paths {P,} consistent with 
perfect foresight and individual optimality. The differential equa- 
tions (7) and (14), together with the conditions (6), (10), (13), (16), 
and 


Te = PJP, (17) 


must be satisfied by any perfect-foresight equilibrium trajectory of the 
economy. We now investigate the restrictions these requirements 
place on the possible equilibrium paths of bonds F;, consumption ¢, 
real balances m;, and, by implication, the home price level. Recalling 
our earlier convention that the world price level P* = 1, we shall 
identify the path of the domestic price level with that of the exchange 
rate, E. 

Differentiating equilibrium condition (16), we find that desired real 
balances must satisfy the relation 


PIP; = pw — (m;/m) (18) 


along an equilibrium path. Combining (17) and (18) with necessary 
condition (13) yields 


m, = [w+ r — x(cy,mz) ly. (19) 


The differential equation (19) must govern the evolution of real 
balances in perfect-foresight equilibrium. 

The flow constraint (7) may be written in the form m, + F; = y + rF; 
+ 7; — Cc; — 1m. When we use (17), (18), and the public-sector budget 
constraint (15), this differential equation becomes 


Fp=y+r(F, +R) — a —g, (20) 


the familiar relation between the current and capital accounts under a 
floating exchange rate. Equation (20), which equates the home coun- 
try’s foreign lending to the excess of domestic income over absorption 
by the domestic private and public sectors, describes the equilibrium 
path of foreign bond holdings. Note that because foreigners do not 
hold domestic money, the variable F; is predetermined by the past 
history of the current account. It can jump instantaneously only 
through central-bank purchases and sales and must otherwise adjust 
slowly over time. 
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It remains to derive the differential equation governing movements 
in consumption. Using the necessary condition (10), we may solve for 
the co-state variable \,; with the help of (15) and (17)-(19): 


Ay = A(C,m,F, + R) = 


Ld(U;) — U,8'(U;)\w' (cr) 
0), + 4{y + 7(F, +R) + [a + 7 — x(c.,m,)|m, — c, — g}o'(U)u'(c,) 


Differentiating A with respect to time and applying (14) yields \; = \,¢; 
+ Anim, + ApFy = dy {6[U(Cz,m,)] — 7}, which may be solved for the time 
derivative of consumption, 


Ex = NS*(ALL{ SLU (cym,)] — 1} — Ante — Avs) = (eum, F, + R). (21) 


Equation (21) is the final differential equation of a three-equation 
system.'° Together, equations (19)—(21) must describe the evolution 
of the economy along any perfect-foresight equilibrium path. 

Not every path satisfying (19)-(21) is a perfect-foresight equilibrium 
path, however, for the conditions from which these paths are derived 
are necessary for optimality, but are not, in general, sufficient. For the 
balance of this paper, we shall focus attention on paths that bring the 
economy to the stationary state (¢,m,F), defined by the conditions 
W(em,F +R) = w+r—x(ém) =y +7r(F + R) —6 — g = 0. Asis easily 
verified, the stationary state exists and is unique when, as we have 
assuIneds pty = 0: 

Appendix A proves that the system (19)—(21), when linearized in a 
neighborhood of the stationary state, possesses a single negative char- 
acteristic root, and so a unique convergent path. Appendix B proves 
that this path is a perfect-foresight equilibrium path: The choices of 
real balances and consumption it dictates are optimal choices from the 
household’s standpoint, given the implied initial value of real assets 
and the implied anticipated paths of prices and transfers. We there- 
fore assume that for any value of the predetermined stock of foreign 
claims, the equilibrium price level and consumption rate are those 
placing the economy on the path converging to the stationary or 
long-run equilibrium.” It should be noted that this convergent 
saddlepath satisfies the household budget constraint (8), for, along it, 


J er le, & aye 1m, = 7; \dt =a, = lima,e" = ay. (22) 

0 tea 
To find the saddlepath, we let 6, denote the system’s negative 
characteristic root and @ = [@1,, @21, @3:]’, an eigenvector belonging to 


‘© The equation giving ¢ cannot be defined if A, = 0. It is shown in Appendix A that \, 
< 0in a neighborhood of the system’s stationary state. In fact, it can be shown that A, < 
0 globally if hyperdeflations are ruled out. 

" It has not been demonstrated that perfect-foresight equilibrium is locally unique. 
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6,. Any convergent solution of the linearized system must take the 
form 


SI 


Ce — € = Wyk exp (6,¢), 


mM, — M = Wok exp (0,t), 


Es 


W31k exp (6,¢), 


where & is an arbitrary constant determined by the value of F at time t 
= 0.” Differentiating, we find that 


é¢ = O1(c¢ — €) = 01(@4;/@51)(F; — F), (23) 
My = 0,(m, — M) = O4(Wo,/w5,)(F; — F), (24) 
F, = 0,(F, —F), (25) 


when the economy approaches the stationary state. The saddlepath of 
the system is simply the intersection of the two hyperplanes defined 
by equations (23) and (24). 

The importance of equations (23)—(25) is that they allow us to 
characterize the motion of consumption, real balances, and external 
asset holdings as the economy converges to long-run equilibrium. As 
is shown in Appendix A, the components of the eigenvector @® must all 
be of the same sign, and an immediate consequence is that the three 
variables must always rise or fall together along the saddlepath." This 
is a key property, for it allows us to trace the path taken by the 
economy in response to various policy surprises. However, one par- 
ticular implication of this result is important enough to be mentioned 
before we turn to such exercises. When the current account is in 
surplus (F, > 0), real balances must be rising, and when it is in deficit 
(rs 0), they must be falling. Thus, the exchange rate’s depreciation 
rate will fall short of the rate of monetary growth when there is an 
external surplus and will exceed it when there is a deficit. This 
fundamental relationship between exchange-rate depreciation and 
the current account appears in models developed by Kouri (1976), 
Calvo and Rodriguez (1977), and others. But it emerges here from 
the hypothesis of optimizing behavior and not asa consequence of the 
assumed properties of consumption and money-demand functions. 

Figure 1 shows how the economy’s long-run equilibrium is found. 
From (14), stationary-state utility U is determined by the equality of 
the marginal rate of time preference and the world bond rate," 


2 See Hirsch and Smale 1974. 

’® Calvo (1979a) provides a similar characterization of the saddlepath arising in the 
Brock-Sidrauski perfect-foresight model. 

4 From (10), A = {[8(U) — U8'(U)\/8(U)}u'(é). The last inequality in (5) therefore 
implies that A is strictly positive. This justifies condition (26). 
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m 


U=8 (r) 





Fic. I 


é(U) =r, (26) 


and so, stationary-state consumption é and real balances m must satisfy 
U(c,m) = U, as shown in the diagram. The second requirement on the 
point (¢,m) is that it lie on the locus of points consistent with m = 0; the 
latter is, by (19), the Engel curve associated with an opportunity cost 
+ r of holding real balances. It slopes upward from the origin. Given 
the value of ¢ determined by U and w + 1, F is determined by the 
relation F = [(¢ + oF y)/r] — R. The locus labeled F = 0 in figure | is 
vertical at the unique private consumption level consistent with 
current-account balance given stationary-state bond holdings F. An 
increase in F would increase long-run domestic income and so shift 
this schedule rightward. 


III. The Effects of Macroeconomic Disturbances 


This section studies the impact and long-run effects of intervention in 
the foreign exchange market, an increase in the rate of monetary 
expansion, and an increase in real government consumption. The 
macroeconomic disturbances are assumed to take the public by sur- 
prise, but they are permanent and lead to no expectation of future 
policy actions. The economy’s initial position is the stationary state. 


Foreign Exchange Intervention 


The central bank intervenes in the foreign exchange market by pur- 
chasing foreign claims from the public with domestic money. The 
transaction accomplishes a transfer of interest- earning assets from the 
public to the central bank as well as an increase in the nominal money 
supply. But the sum F + R of privately owned bonds and central-bank 
reserves is unaffected by this transfer. 

Now, equations (20) and (21) involve only the economy’s aggregate 
claims on the rest of the world; the distribution of their ownership 
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between the public and the central bank is irrelevant. Thus, the 
long-run value of F simply declines from F to F — AR asa result of the 
intervention, while ¢ and m are unchanged. Because the stock of 
privately owned foreign assets is immediately at its new long-run level, 
consumption and real balances must, by (23) and (24), remain at their 
respective stationary-state levels ¢ and m. The monetary authority’s 
action occasions a rise in the price level (a depreciation of the ex- 
change rate) exactly proportional to the increase in the nominal 
supply of money. It has no real effects. Money creation accomplished 
through a purchase of foreign exchange has the same impact as a 
“helicopter” money-supply change of equal magnitude. 

It is sometimes asserted that there zs a difference between these two 
financial policies, in that an official purchase leads to a current surplus 
as domestic residents seek to restore their holdings of external claims 
to the original level.'* The difference disappears here because foreign 
assets held by the central bank continue to earn interest at the rate 7, 
while households capitalize all transfers from the government (see 
Mundell 1971, chap. 1). By the government budget constraint (15), 
the stream of interest earnings taken away from the public through 
central-bank intervention is returned in the form of transfers. 
Households therefore suffer no fall in permanent disposable income 
when a portion of their bond holdings is transferred to the central 
bank. 

This neutrality result would clearly break down if, for example, 
reserves were held in some barren, non-interest-bearing form, or if 
increased central-bank earnings were used to augment government 
consumption rather than transfer payments. That intervention has 
real effects in these circumstances is to be expected, for it involves real 
fiscal measures in addition to a neutral change in the nominal supply 
of money. 


An Increase in the Monetary Growth Rate 


Figure 2 depicts the effects of an increase in the rate of monetary 
growth, yw. As (26) shows, the long-run instantaneous utility level of 
the representative household is unchanged by the disturbance. But as 
the rate of monetary growth and the inflation rate coincide in the long 
run, the disturbance raises the opportunity cost of holding money in 
the stationary state. Thus, the household economizes on real balances 
and consumes more in the new long-run equilibrium. This is repre- 
sented in figure 2 by the downward shift of the Engel curve along 


15'See, e.g., Kouri 1978. 
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which real balances are stationary. The rise in long-run consumption 
entails a rise in long-run bond holdings. Accordingly, the F = 0 locus 
shifts to the right, so that it passes through the new asymptotic 
equilibrium point, (¢’,m’). 

Knowledge of the long-run effects of the policy shift allows us to 
infer its short-run effects using the properties of the convergent 
saddlepath described in Section II. Because the eventual level of 
external assets is higher after the rise in w, consumption must initially 
decline to bring about the implied current-account surplus. And be- 
cause real balances rise during the course of a surplus and m’ is lower 
than m, the policy also entails a sharp initial fall in real balances and so 
a depreciation of the exchange rate. The economy’s short-run, tem- 
porary equilibrium at t = 0 is thus at a point like 4 in figure 2. 

Both consumption and real balances rise from the levels given by 
point A as the stock of foreign claims approaches its higher, asymp- 
totic level. This implies, in particular, that the rate of depreciation of 
the exchange rate during the transition will fall short of the higher 
rate of monetary expansion. The exchange rate thus exhibits an 
“overshooting” pattern of adjustment, in that the impact fall in real 
balances exaggerates the long-run decline. This reproduces, in an 
optimizing framework, the result obtained by Kouri (1976), Calvo and 
Rodriguez (1977), and Flood (1979) on the basis of assumed aggre- 
gate relationships. 

It is of interest to note that the integral of discounted utilities given 
in equation (2) is lower along the transition path following a rise in 
than it would have been along the original, unperturbed stationary- 
state path. This follows from the last inequality in (5) and the obser- 
vation that the household’s instantaneous utility falls the moment 
increases, rising to its original level only in the long run. Thus, an 
increase in the growth rate of money, in the present setting, has an 
unambiguously negative effect on the welfare of economic agents. 
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An Increase in Government Consumption 


We analyze, finally, the consequences of an increase Ag in real gov- 
ernment consumption. It is assumed initially that government con- 
sumption is wasteful, in that it does not enter into the household’s 
uulity function. An example in which government spending produces 
a public good complementary to private consumption is analyzed 
next. 

In the absence of any change in private consumption from its initial 
level ¢, an increase in government consumption would entail a 
current-account deficit at the initial level, F, of foreign assets. This is 
shown in figure 3 by the leftward shift of the F = 0 locus to F’ = 0, 
which is vertical at the lower level of private consumption compatible 
with external balance the moment after the increase in g occurs. 

The steady-state levels of consumption and real balances are not 
altered by the government’s action, however, for these variables are 
determined, independently of the level of g, by the two conditions, 
6[U(c,m)] = r, x(é,m) = w +r. It follows that the stock of external 
claims associated with the economy’s new stationary state must be 
higher than the initial stock, F: In the long-run equilibrium following 
the increase in public-sector expenditure, the private consumption 
level ¢ can be sustained with no net borrowing from abroad only if the 
economy's privately owned foreign assets rise by the amount Ag/r, so 
that additional interest payments from foreigners finance the addi- 
tional national absorption. 

Because the economy’s adjustment path is noncyclical, the current 
account must go into surplus when the rise in g occurs. Consumption 
thus declines by an amount greater than the increase in government 
spending. And because consumption, real balances, and foreign bond 
holdings rise together along the saddlepath, the exchange rate neces- 
sarily depreciates on impact, reducing real balances in the short run 
so that they may subsequently increase toward their long-run level, m. 
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The point labeled 4 in figure 3 again represents short-run equilib- 
rium at ¢ = 0. 

When the monetary growth rate, mu, is zero, the government's action 
is simply a tax-financed increase in government spending. The 
model's prediction contradicts the standard Keynesian presumption 
that the accompanying short-run fall in disposable income must re- 
duce private saving along with private expenditure, pushing the cur- 
rent account into deficit. In the present framework, private saving 
must actually increase in order that the economy may attain, in the 
long run, the higher level of national income consistent with the 
predisturbance private consumption level. Looked at another way, a 
tax-financed increase in government spending entails a more than 
complete crowding out of private aggregate demand in the short run. 

The preceding discussion has been based on the assumption that 
the level of government spending does not enter the utility function, 
as it would if government consumption resulted in the provision of 
some public good. To illustrate how the analysis might change if this 
assumption were modified, we study a particular example. 

Assume now that the utility function has the form 


U(c,m,g) = u(c,g) + v(m), Ug, Ucg > O. (27) 


According to (27), public and private consumption are complemen- 
tary goods. The stationary values of consumption, real balances, and 
foreign assets are determined by the equations 


6[U(¢m,g)l=r, (28) 
u'(m)/u(C,g) = wh +7, (29) 
F=[(¢€+g-y)r]-R. (30) 


In particular, (28) and (29) imply that ¢ and m are no longer inde- 
pendent of g, as they were in the case of purely wasteful government 


spending. % 
The long-run derivatives of ¢, m, and F with respect to g are: 
dc Zt —1 a op ory 
a Ae —7(D[u,o" + U' (je + r)ttegl, (31) 
& 
oe = 1D“\(t + 1) (tidlieg — tiytice), (32) 
= = Date -. Ug)o = v' (pe ei i) tee ee Wea), (33) 


where D = r[u,v”" + (wu + r)ted'] < 0 and functions beneath overbars 
are evaluated at the initial long-run equilibrium. Only (32) can be 
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unambiguously signed under the assumption u,, > 0.16 In particular, 
if the marginal utility of private consumption is sufficiently low rela- 
tive to that of public consumption, an increase in g may, as (33) shows, 
occasion a current deficit, contrary to the earlier analysis. In this 
situation, private consumption falls by an amount smaller than Ag in 
the short run; however, there is a decline in long-run consumption 
that exceeds Ag. 

In the event that the level of public-good provision happens to be 
initially at the Samuelson static optimum, so that a, = u,, (32) and (33) 
imply that the increase in g must occasion a current surplus and a 
depreciation, as before. Why must stationary consumption rise above 
c — Ag in this case? If, in the long run, consumption were to fall by 
exactly Ag with m unchanged, the economy would be at its long-run 
utility level U, but with v'(m)/u.(¢ — Ag, g + Ag) < w +7, in violation of 
(29). Long-run equilibrium thus requires that real money balances fall 
below m and that consumption rise above ¢ — Ag. 

The welfare analysis of an increase in government consumption is 
straightforward. We note here simply that when u,, > 0 and a, = ties 
an increase in g must decrease the discounted sum of future utilities V 
below U/8(U), for it induces a current surplus and so a short-run 
decline in private consumption greater than Ag. The implication is 
that a level of public expenditure optimal from a static standpoint 
may no longer appear optimal once full account is taken of the 
dynamic repercussions of a change. 


IV. Conclusion 


This paper has developed an Open-economy model based on house- 
hold utility maximization over an infinite lifetime. The model answers 
several questions concerning the effects of macroeconomic policies 
when exchange rates float freely. In some cases, the answers coincide 
with results derived previously on the basis of more or less mechanis- 
tic behavioral assumptions. But in other cases, the answers contradict 
these results. 

In particular, the model implies that money is not superneutral 
in either the short or long runs and places both the exchange-rate 
“overshooting” hypothesis and the association between depreciation 
and current deficits on a firm microeconomic foundation. However, it 


'6 The following considerations imply that dm/dg < 0. Because u,, > 0, the increase in 
g would entail an increase in the marginal utility of private consumption at the initial 
level of c. If m were to rise, v' would fall, and (29) could not be satisfied in the long run 
in the absence of a higher stationary consumption level. But with ¢, m, and g all higher, 
the household’s long-run utility level would exceed U , contrary to (28). Thus, m declines 
and v'(m) rises. 
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casts doubt on the validity of inferences drawn from the Keynesian 
consumption function which, in some form, is central to the classic 
Mundell-Fleming model and most subsequent work. In addition, the 
approach highlights the importance of the government and central- 
bank budget constraints in discussions of the effects of financial policy 
measures. 


Appendix A 


This appendix provides the proof of local saddlepath stability for the system 
(19)—(21) and characterizes the unique path leading to stationary-state 
equilibrium. 

The model involves two state variables, c and m, which may take (unantici-” 
pated) discrete jumps, and a third state variable, F, which is predetermined by 
the past history of external asset accumulation. Thus, to prove saddlepath 
stability, it must be shown that when linearized about its stationary position, 
the differential equation system, 


oS A{S[u(c) + v(m)] — r} — Aplw +r — x(cym)\m — Nee eae Hy) 6 — "| 
Xe 





= (c,m, F + R), 
m= [ut+r—x(cym)|m, 
F=y+r(F+R)-c-—g, 


possesses two positive characteristic roots and one negative characteristic root, 
implying the existence of a unique path converging to long-run equilib- 
rium. Recall that 


Bz: {8 = [u(c) + v(m)]8" pu’ 
OF Age PR) ip Sle) nic ghoul” 
x(cym) = v'(m)/u'(c). 

At the system’s long-run equilibrium (¢,m,F), we have ¢ = m = F =a = 
Olu (c) + v(m)] — r = 0, and, linearizing in a neighborhood of this point, the 
system becomes 

€ = Wel — 6) + Palm — mi) + welF — F), 
m = —x,m(c — C) — X_m(m — m), 
F = -(c—6)+7r(F —F). 


(Functions beneath overbars are evaluated at (Em,F ].) Using the definition of 
long-run equilibrium, we find that 


We = (A/Xe) 5! + (An/Ae)Kert + (Ape), (Al) 
Win = (AIN-)5'I" + (Xm/Ae)Xmm, (A2) 
te = —(Ap/d-)r, (A3) 


where 
5 — (a + 0)8')[ Su" + 8(u')2x.m] — (G+ 0 
§2 





ee) 
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om [6- (a+ 0)5'|( ‘UX yan re babi v)u'v' 88" (A5) 


62 

ine “15 = + 0)5T' WY — 9. (A6) 
fs) 

Now x, = —v'(m)u"(@)/[u' (QP > 0 and Xp = v"(m)/u'(c) < 0; and since U 3 0, 

5 — US’ > 0, and 8" > 0 by assumption, 
Ae! (A7) 

Further, substituting for x, in (A4) yields 
= [5 — (a + v)8'\(6 — samme = (w+ 8)(a'Pdd" (A8) 

Cc 5: 


Because u(-) and v(-) are nonnegative and concave, we have, using (5), 5 — 
5’0'm > § — 08’ > & — (i + 0)’ > O, and (A8) implies immediately that 
hae. 0: (A9) 


We note, finally, that because \ = {5 — [u(é) + v(m)]d'}u'/d, (Al) and (A6) 
imply that 
We i m/Xe)Xom =). (A10) 


Using (A10), the linearized system becomes 





, AXmXem NGO IN en 1 —\pr E 
c = = = C= 6 
Xe Xe Xe 
= i (All) 
m - ey mail 0 Nia ( 
F =] 0 r F-F 


By (A10) and the fact that x,, < 0, the trace of the system’s matrix is positive. 
Since this trace is also equal to the sum of the system’s characteristic roots, 
there must be at least one root with positive real part. 

On the other hand, we may write the system’s determinant as 


D1. = [Xmrmi(Ap/Ae)r] + r(KeM)(NS'T' + XX 77)/Ne — (nM) (Non Ne) Kee 
= [Xm (Ap/d-)r] + r(X-m)(N5'D')/A. < 0. 


Because is the product of the system’s characteristic roots, it must be the 
case that there are either three roots with negative real part or one negative 
root. (Recall that complex roots must occur as conjugate pairs.) Since we 
already know there cannot be three roots with negative real part, there must 
be exactly one negative root. This proves that in a neighborhood of long-run 
equilibrium, our system is saddlepath-stable. 

It remains to provide a characterization of the unique stable path. Let 6, 
denote the system’s sole negative root, and let @ = [@11,@21,@3,]’ denote an 
eigenvector belonging to 6,. As was shown in the text, c, m, and F must rise or 
fall together along the saddlepath if the components of @ are all of the same 
sign. We now prove that this is the case. 

First, we note that if one of the components of @ is zero, all must be zero, 
and so all components must be strictly positive or negative. To see this, 
suppose @,, = 0. Then, using the last equation of (A11), —@,, + 703; = 0,051, 
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and so @3, = Q. But using the second equation of (A11), we find that w), = 0as 
well; and so » = 0. Similar arguments dispose of the possibilities that w., = 0 
anaathiatcs7—s 0: 

So suppose w3; > 0. From (A11), —@,; + r@3,; = 0,03, < 0, and so it must be 
the case that w,, > 0. Again using (A11), we see that (—x,m)w,, + (—X»mM)wo; = 
6,@.,, implying @.,; = [(—x,.m)/(A, + X,m)]w,,. Thus, it must be true that @», > 
0. In this case, therefore, all components of @ have the same sign. 

In the case w3, < 0, an identical argument shows that @;;, @2; < 0 as well. 
This completes the proof. 


Appendix B 


This appendix contains a proof that the unique convergent path of the system 
(19)—(21) is a perfect-foresight equilibrium path. The minor technical issue in 
the proof is that the state variable a, may assume negative values, so that the 
sufficiency theorem for optimal controls (see Arrow and Kurz 1970, p. 49) 
appears inapplicable. We show that the intertemporal budget constraint (8) 
implies that a, must be bounded from below when consumers expect the path 
of prices implied by the convergent solution to (19)—(21). This fact, in the 
present context, implies that the sufficiency theorem remains applicable. 
Substituting the flow constraint (7) into (8) and integrating by parts, we 
obtain the inequality 
lim “a,e-* = 0. (B1) 


to 


Condition (B1) may be used to prove the inequality 


a Pl é = 7 .ds = 0, for allt, (B2) 
t 


where hatted values are those expected to prevail along the convergent path. 
Assume (B2) fails at time t’. Integration of (7) implies that for ¢ > ¢’, a; must 
satisfy 


t 
ee +f es” [F. —c, — (7, + r)m, lds, (B3) 
ae 


for any path of consumption and real balances. Multiplying both sides of (B3) 
by e- and taking limits as t > ™ yields the equality 


lms — olay =| ii ats aa or wel Cee r,s. (B4) 
5 


toa 


Because consumption and real balances are nonnegative, the right-hand side 
of (B4) must be negative if (B2) is false at ¢’.'” This contradicts (B1), showing 
that (B2) must hold at each instant. 


Define 
k = sup {J eras), 
t ie 


The number £ is finite as a result of the boundedness of the sequence {7;} 
along the convergent path. From (B2), 


a, + k = 0, for allt. (B5) 


'7Qur assumptions ensure that 7 + r > 0 at all mes. 
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It is now straightforward to apply the sufficiency theorem. Redefine the state 
variable for the household’s problem to be w; = a + k. The flow constraint (7) 
becomes w; = rw; + 7%; — ¢ — (7% + 7)m, — rk. The differential equation system 
(19)-(21) is unchanged as a result of this modification, and along its conver- 
gent path, 

lim wyd,e 4 = 0, Jim Wwe? = Oe 

A—s00 00 
Because negative values of the state variable are infeasible, the sufficiency 
theorem implies that the choices of consumption and real balances implied by 
the convergent path are indeed optimal choices, given the associated paths of 
the price level and real transfer payments. The convergent path is therefore a 
perfect-foresight equilibrium path. 
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The wages and employment of typographers are examined to see 
whether they can be usefully characterized as the outcome of a 
process by which the union maximizes an objective function con- 
taining wages and employment and is constrained by a trade-off 
between these two variables as represented by the employer’s labor 
demand function. Our functional form assumptions permit investi- 
gation of some familiar special cases of union behavior. The 
parameter implications of both the wage bill maximization hypothe- 
sis and the rent maximization hypothesis provide inferior expla- 
nations of the movement of wages and employment of these workers 
compared with our more general formulation. 


I. Introduction 


Although empirical studies measuring the effects of trade unions 
have multiplied in the last few years, the behavioral underpinnings of 
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unionism remain poorly understood. This is because the issues raised 
in the literature on trade union goals and wage determination have 
been mired in fruitless methodological debates or simply have not 
been pursued at an empirical level. Consequently, although a variety 
of plausible models of unionism have been proposed, an evaluation of 
them in terms of their correspondence with observed behavior would 
rest today on barely more reliable information than was available to 
Dunlop (1944) when he wrote his seminal work almost 40 years ago. 
This dearth of research is all the more remarkable when contrasted 
with the abundant theoretical and empirical work by economists on 
the behavior of firms and of families. And yet, as Farber (1978) ably 
demonstrated with his analysis of the United Mine Workers, there is 
nothing about the trade union as an institution that makes its central 
features impossible to characterize in a framework analogous to pur- 
posive models in economics.' 

The model of the trade union analyzed in this paper is by no means 
original; it is usually associated with the work of Fellner (1947) and 
Cartter (1959), among others. It characterizes the union as maximiz- 
ing a function in which wage rates and employment are arguments 
and as being constrained by a trade-off between these two variables 
represented by the employer’s labor demand function. The novel 
feature of this paper is that we have identified a setting that seems 
suitable for the operation of the model and have estimated the wage 
and employment functions that result from this characterization. 
Therefore, this represents the first empirical application of a model 
that has been widely discussed in the literature. Moreover, several 
models of trade union behavior are nested as special cases in our 
framework so that we are able to provide formal tests of some popular 
hypotheses. Our purpose is not to champion a specific model but 
simply to examine whether, in fact, the particular purposive model 
that is common in the literature provides a viable framework for 
analyzing union behavior. Only through a careful documentation of a 
number of empirical studies will the profession be in a position to 
evaluate the relevance of this model. 

In this paper we first describe some basic features of the newspaper 
industry and of the International Typographical Union (ITU), which 
is the union with which this paper is concerned. As is well known, this 
is a powerful, closed-shop union that operates in an industry whose 
technological features have changed considerably over the last 10-15 
years. This paper focuses on a period (from 1946 to 1965) before the 
drastic changes in the nature of newspaper composition, and during 


1 This opinion differs from the one widely held that, as expressed by Reder (1952), 
“the behavior of firms and consumers can be easily interpreted as ‘maximizing’ 
while that of a union cannot” (p. 34). 
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these years the model is less likely to have experienced structural 
change. We provide reasons for believing that bilateral monopoly is 
not an apt description of the market for typographers, but we cannot 
claim that our results vindicate this judgment since a bargaining 
model is not outlined and the estimates are interpreted in terms of the 
Fellner-Cartter model. Since this model seems to have escaped formal 
description, we offer such a description in Section III and also intro- 
duce our particular functional form assumptions. In Section IV we 
present our experiences with estimating the model, first with an 
excellent data set from Cincinnati and then with data on seven other 
unions. Some conclusions follow in Section V. 


II. Institutional Setting 


The organization, structure, and traditions of the ITU suggest several 
features relevant to an analytical model of union behavior. The 
structure of the ITU parallels that of the printing industry, which 
mainly consists of geographically segmented product markets. Today 
there are approximately 1,000 chapters of the union which negotiate 
individual contracts with local printing establishments, the most im- 
portant being newspaper firms. In contrast to many industrial unions, 
the membership of the ITU is remarkably homogeneous. All mem- 
bers undergo the same training, there is no observable prestige dif- 
ferentiation, and education levels and demographic characteristics 
are quite uniform.? 

Officers of the ITU on both the national and local levels appear to 
be extremely responsive to the wishes of their constituent typog- 
raphers, and the union is frequently described as the most democratic 
trade union in the United States. Two active political parties compete, 
and a rather frequent turnover of officers occurs through the electoral 
process.* The union leaders have few incentives to entrench them- 
selves in office. Financial remuneration, even at the highest level, 
differs little from Journeyman wages, and local officers are often 
workers who perform union duties in their spare time. Negotiated 
settlements are as likely to affect their own financial status as that of 


* Lipset, Trow, and Coleman (1956) write that “. . . all of the members of the ITU 
share a roughly common income and status. . . . The ITU is thus, insofar as this is 
possible, a community of equals, and there is consequently no ‘underprivileged’ group 
in the unions . . . all subskills contribute roughly equal proportions of active leaders and 
one finds no tendency for a high-status group to monopolize and dominate internal 
politics” (pp. 142-43). 

* Indeed, a survey of the largest ITU local, New York no. 6, revealed that over 
one-half of the membership held some union office at one time or another (see Lipset et 
al. 1956). Referenda are conducted on all sorts of issues, with voter participation 
reaching 60 to 70 percent even when the outcomes have marginal impacts on individual 
members. 
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the “rank and file.” The dichotomy between the interests of officers and 
of members that assumes importance in some unions appears less 
relevant to the ITU. Local contracts are constructed and negotiated as 
prescribed by union laws which have existed for decades and which 
require membership participation and ratification at each stage. 

Over the years, the central organization of the ITU has been con- 
cerned with broad issues which have relevance to the union as a 
whole.’ In contrast, issues concerning remuneration are negotiated at 
the local level, and the wage scale negotiated is applied to all jour- 
neyman printers. There is evidence that the regular employees are 
not indifferent to the welfare of the “marginal” or substitute workers.°® 
Indeed, as early as 1890, the general laws of the ITU contained 
provisions which attempted to distribute work evenly to all em- 
ployees.® More recently, the practice has been to fill all vacancies from 
substitute lists of irregularly employed workers. 

Therefore, the ITU is a union with a remarkable democratic tradi- 
tion, with no clear divergence of interests between different groups 
within the union, and with an apparent concern both for the em- 
ployment of its members and for their remuneration. A further 
attribute deserves mention: Although strikes in the newspaper print- 
ing and publishing industry as a whole are not uncommon, those 
involving the ITU are infrequent.’ Indeed, for the ITU locals whose 
data we use below, no strikes occurred during the 1946-65 period.® 
The relative unimportance of strikes is consistent with those models 
that account for strikes in terms of incomplete information either 


4 These have focused on the union’s jurisdiction over printing occupations and the 
proprietary right of members to employment. Most recently, the ITU has resisted the 
adoption of new technology which permits electronic and computerized composition of 
newspaper product except in cases where jurisdiction over new skills and maintenance 
of employment is permitted (see Kelber and Schlesinger 1967). 

5 For instance, all time accumulated in excess of the amount of hours constituting a 
regular daily or weekly shift is posted as overtime. When this overtime becomes 
equivalent to a day’s pay, the regular is required to take a day off and engage a 
substitute as long as one is available. This philosophy of “work sharing” probably 
reached a zenith during the Great Depression when a national convention adopted a 
referendum vote approving a law directing all situation holders to put a substitute on 
work at least 1 day per week. 

6 See International Typographical Union 1964, pp. 312-13. Union membership is 
sometimes viewed as a legacy to be passed from generation to generation so that family 
ties among the union members provide a further reason for the typical journeyman to 
take account of the employment effects of his union’s policies (see Lipset et al. 1956, p. 
206). 

7 For instance, in 1945 (the year prior to the period examined below) only 32 of 1,000 
locals engaged in any strikes, and all but one of the struck firms resumed publication 
within a few days. This information is from the industry’s weekly magazine, Editor and 
Publisher (1946, p. 76). 

8 Therefore, it involves not too drastic an assumption if the costs of negotiation and 
the role of strikes which sometimes figure in models of union behavior (see Mitchell 
1972; Atherton 1973) are neglected in this study. 
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within the union or between the union and the employer: Rates of 
concession are unknown and the negotiation process reveals infor- 
mation about what to expect from the opposition. However, in the 
newspaper industry, negotiations have been taking place between the 
same unions and newspaper firms for many decades, the ITU trade 
journals regularly publish wage scales for each of the 1,000 locals, and 
what is sometimes described (Reder 1952, p. 39) as a “mature” collec- 
tive bargaining relation obtains. 

The rarity of strikes reflects in large part the relative vulnerability 
of the local newspaper firm to a prolonged interruption of produc- 
tion. In particular, the firm’s product, the prompt delivery of up-to- 
date information, cannot be stockpiled to withstand a shutdown so 
that advertisers and subscribers are apt to turn to alternative media as 
a result. Once a firm closes down, it is expensive to regain former 
customers, especially advertisers, who provide an average of about 75 
percent of all newspaper revenue and who can turn to ready alterna- 
tives (television, weekly newspapers, direct mail, billboards, radio) for 
their purposes. In contrast, the ITU maintains a defense fund to aid 
striking local unions, and members have the opportunity to work 
elsewhere in the event of the newspaper firm being driven out of 
business. Hence, with the newspaper firm facing a number of im- 
mediate competitors in its product market and with the typographers 
able to draw on a sizable strike fund and to arrange for work in other 
cities, the setting is not a confrontation between two equally powerful 
adversaries: The union is clearly in a dominant position vis-a-vis the 
firm. This provides some support for interpreting the pattern of 
typographers’ wages and employment over time as tracing out the 
union’s preferences. 


III. The Conceptual Framework 


A General Statement 


From a description of the institutional setting, we turn now to a 
formal statement of the union maximizing model. The union is 
characterized as behaving as if it possesses a twice continuously differ- 
entiable, strictly concave, objective function, 


w 

U = g(t), (1) 
Pp 

where w measures the wage rate, p the price level of commodities 

consumed or the “cost of living,” and L union employment. This 

union is assumed to be involved in producing such a small part of the 

economy’s total output that any effect of its decisions on the overall 
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price level (p) may be neglected. Hence p is taken to be exogenous to 
the union, while w and L are endogenous. The union consists of many 
members so that an issue arises of whose utility function is repre- 
sented by equation (1). This has been a recurrent theme in the 
literature on trade union goals, though formally the problem is no 
different in kind from economic models of family decision making 
where the family consists of more than one individual or from the 
theory of the firm where there is more than one owner.® The typical 
ITU local has already been described as one with an unusually 
homogeneous membership and where conflicts of interest between 
the union leadership and the rank and file are not pronounced. 
Therefore, the objective function above is “the” union leader’s, and 
“he” is assumed to integrate the welfare of all the union’s members. 
The employer’s cost-minimizing labor demand function is given by 


E = f(wri7e, Oey Ti) (2) 


where 7; represents the unit price of inputi and X is the level of output 
produced with n + 1 inputs by the firm.!° As is well known, L is a 
negative function of w, L must be increasing with respect to at least 
one other input price, L increases with output (provided labor is not 
an inferior input), and this labor demand function is homogeneous of 
degree zero in all input prices. 

The union is assumed to select w and L to maximize equation (1) 
subject to the labor demand function given by equation (2). This 
yields the following first-order condition: 


meee 
ea eee? (3) 
where g, = 0U/0(w/p) > 0 and g, = OU/OL > 0: The marginal rate of 
substitution of wages for employment in the union’s objective func- 
tion equals the cost-of-living normalized slope of the employer’s labor 
demand function with respect to the wage rate.' This first-order 


® The problems involved in aggregating over preferences are discussed in Samuelson 
(1956) and Becker (1974). A rationalization of eq. (1) in terms of a “. . . union as a 
political institution run by a bureaucracy whose goals are not necessarily identical with 
those of the membership” appears in Kotowitz and Portes (1974). 

10 Our assumption with the newspaper firms below is that their output is determined 
primarily by the demographic characteristics of the local media market which are 
exogenous to a single firm. In fact, we investigate below the possibility that output 
should be treated as endogenous (see also Rosse 1970 and Dertouzos 1979). 

11 As is well known, this tangency solution between the union’s indifference curve and 
the slope of the firm’s labor demand function does not lie on the contract curve (see, 
e.g., Leontief 1946 or Fellner 1947). This feature may be unrealistic in some 
employer-union settings, but it is not clear that it is so for typographers. The second- 
order condition concerns the relative rates at which the union’s isoquant and the 
employer’s labor demand function are changing: p gu, + 2p 'g:2(0f/dw) + g2(0*f/0w") 
+ g(df/dw)? < 0. 
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condition together with the employment demand function (2) may be 
solved to derive the reduced-form equations of the system: 


Wi b( p17 1,72 orien 1n,X) (4) 
and 
LAUD) oe hee) (5) 


Since an increase in the cost of living reduces the real value of the 
wage rate without disturbing the employer’s structural labor demand 
function (2), the effect of an increase in p on the wage rate will be 
Opposite in sign to its effect on employment. Other than this, without 
further restrictions on the form of the union’s objective function and 
the labor demand function, these wage and employment equations 
are devoid of any unambiguous sign implications.!2 This dearth of 
qualitative predictions seems to have escaped notice in the extensive 
literature on this simple maximizing model. 

If the reduced-form equations (4) and (5) are substituted back into 
the union’s objective function (1), an expression is derived that relates 
the maximum value (call it V) of the union’s objective function to the 
exogenous variables of the system: 


VA Dr iiss eee bie Xe (6) 


Through the envelope theorem it is straightforward to show that V 
decreases with respect to p and with respect to the price of any input 
that is complementary with labor and that V increases with respect to 
output and with respect to the price of any input that is substitutable 
for union labor. Moreover, the trade-off in the union’s optimizing 
objective function (6) between increases in the prices of any two inputs 
is simply represented by the ratio of the derivatives of the employer’s 
labor demand function with respect to each of the input prices:!% 
(OV /dr;)/(OV/dr;) = (Of/Or;)/(Of/Or;). Since these derivatives of the 
labor demand function are related to the question of the degree to 
which this labor is substitutable for (or complementary with) other 
inputs, this illustrates the relevance of the production technology in 
constraining the union’s opportunities. 


The sort of restrictions on eqq. (1) and (2) that generate refutable sign implications 
may be illustrated as follows. If eq. (1) is strongly separable in wages and employment 
(or, less restrictively, if £12 = 0), if labor is not an inferior factor, and if an increase in 
output does not reduce the slope of the labor demand function (i.e., if 07f/8wAX = 0), 
then an increase in output or an increase in the price of a substitute input results in a 
higher optimal money wage rate. 

*® Also (V/0r;)/(AV/AX) = (Af/dr,)/(Af/X). 
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Particular Functional Forms 


In order to implement this framework, at some stage in the analysis 
some explicit functional forms have to be introduced. These functions 
should be consistent with the institutional features of the newspaper 
industry, they should be straightforward to interpret, and they should 
be computationally manageable. The approach in this paper is to 
specify particular forms for the union’s objective function (1) and for 
the firm’s employment demand function (2) and then to use the 
first-order condition (3) to solve for the corresponding reduced-form 
system (4) and (5). Thus the union’s objective function defined over 
real wages and employment is specified as follows: 


ik Wee Oe aa Vo eaeaed RS, 
uF LL) e y] ete, (7) 
This function, the Stone-Geary, claims an excellent pedigree in em- 
pirical work on estimating systems of consumer demand equations, 
though this represents its first application to union behavior. In fact, it 
possesses a number of attractive features for our purposes. Though it 
is not necessary to adhere to this interpretation, following Samuelson 
(1948), y and 6 may be thought of as “reference” or “minimum” or 
“necessary” values with the function being homothetic to the point 
(y,5), and (w/p) — yand L — 6 represent “supernumerary” real wages 
and employment, respectively. The value of @ indicates the relative 
importance attached to supernumerary wages versus supernumerary 
employment; only when y = 6 = 0 may @ be interpreted as measuring 
the relative weight of wages and employment in the union’s objective 
function. 

One appealing feature of equation (7) in this context is that it nests 
other models of union behavior as special cases. For instance, Dun- 
lop’s model of “maximization of the wage bill for the total member- 
ship” (1944, p. 44) is equivalent for the closed-shop ITU to specifying 
6 = .5 with y = 6 = O in the objective function.'* Or the rent 
maximization hypothesis (Rosen 1970; de Menil 1971) implies 0 = .5, 
5 = 0, and y equal to the competitive wage. Dynamic elements may be 
introduced by relating y and 6 to lagged values of real wages and 
employment so that the union’s objectives are those of wage growth 
and employment growth.’ Where a union designs its wage policy with 


14 Clearly, with y and 6 equal to zero, a value of 6 of 1 yields an objective function that 
consists simply of the wage rate (a model of union goals sometimes attributed to Simons 
[1944]), while a value of 6 of 0 implies that only employment matters. 

15 Cf. “The bargaining process is almost always directed toward an upward or down- 
ward change in a pre-existing wage rate and the greatest interest often centers on the 
magnitude of the change” (Ross 1948, p. 46). Formally, this particular extension of the 
Stone-Geary was proposed and analyzed by Stone (1966) and Pollak (1970). 
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reference to what another union has attained or toa nonunion wage, 
then y assumes the value of this comparison wage fate.’ Hence the 
Stone-Geary function can accommodate a number of alternative 
hypotheses that have received attention in the literature on union 
objectives. 

The union’s opportunity set is defined by the newspaper firm’s 
labor demand function, the expression for which is as follows: 


L = a+ a,(2) - a| =) + a3X + aD, (8) 
1 ; 
where, as before, the 7’s indicate the prices of nonlabor inputs and X is 
the firm’s output. The negatively sloped labor demand function re- 
quires a, < 0 and, if labor is not an inferior output, then a3; > 0. 
During our estimating period (1946-65), there was considerable 
merger activity in the newspaper industry;!7 so, to allow for such 
mergers causing a shift in the labor demand function, equation (8) 
includes a dichotomous variable, D, that takes the value of unity when 
a newspaper merges with another. This parameterization is consistent 
with a merger resulting in an entirely different cost function for the 
newspaper, different in its intercept and different in its slope with 
respect to each of its arguments. This linear-in-the-parameters labor 
demand function, equation (8), generates tractable estimating 
equations and yet conforms to what is known about the production 
technology in the newspaper industry."* 

The first-order condition for a maximum of (7) subject to equation 
(8) is 0/(1 — 0) = —[(w — py)ay]/((L — )r,], which may be solved for 
the reduced-form wage equation expressed in real terms as 


2 =p + e(2) + 4(2)+8(—")+e() © 


p P p Pp 
where Bo = (1 — 0), By = Oa7'(5 — a), Bs = —Ba,az", Bs = —Oaz07", 
and B, = —0a,a,'. Correspondingly, the reduced-form employment 
equation is 
L = Bs + Bo(L) + B,(=) + BaX + BoD. (10) 
1 1 


16 The importance of such comparisons in affecting the goals of trade unions has 
been stressed by Ross (1948). Again, formally it involves a similar sort of extension of 
the Stone-Geary function as that analyzed by Pollak (1976). 

17 Almost 8 percent of all cities with daily newspapers had more than one newspaper 
in 1948. By 1968 this had fallen to less than 3 percent (see Dertouzos 1979). The effect 
of a newspaper merger on union preferences was also investigated, and the results are 
reported below. 

18 More information on the newspaper production technology is contained in Rosse 
(1970, 1977). Also estimated with the Cincinnati data were labor demand functions 
whose slopes and intercept changed with a merger, but the results did not differ 
materially from those reported below. 
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where B; = a + 0(5 — a), Bg = ayy(1 — 6), By = (1 — @)az, Bg = (1 — 
9)a3, and By = (1 — @)a4. Observe that, since 0 < 0 < 1, a3 > 0, and a; 
< 0, these equations imply a positive correlation between wages and 
employment, on the one hand, and firm size, on the other hand." 
These particular functional form assumptions yield reduced-form 
employment and real wage equations that are homogeneous of de- 
gree zero in p,7;, and ry. By substituting equations (9) and (10) into the 
union’s objective function (7), we derive an expression that relates the 
maximum value of this function to the exogenous variables of the 
system: 


V (p.r1,72,X ,D) = 


girl = oy? 

= or ae Layyp + (Qo — 9)ry + Gere + agXr, + a,Dri], (11) 
ap 14 

which 1s also, of course, homogeneous of degree zero in p, 7,, and 72. 
With the equations of the system described, we turn now to the 

estimation of the model, first with a data set on the ITU in Cincinnati 

and then with other union locals. 


IV. Empirical Results 
Estimating Methods 


The structural parameters of our system were estimated by two pro- 
cedures. The first exploits the fact that the cost of living, p, enters the 
system through the union’s objective function (7) but is excluded from 
the firm’s structural demand equation (8). Thus, using p as an instru- 
mental variable to predict values for (w/r,), a conventional instru- 
mental variable procedure was applied to equation (8) which provides 
estimates of the a’s. These a’s thus estimated were then imposed in 
estimating the reduced-form wage equation (9) which yields point 
estimates of the parameters of the union’s objective function (namely, 
6, y, and 6). The second estimating procedure involves the simultane- 
ous estimation of the stochastic form of equations (8) and (9) by full 
information maximum likelihood.?® Conditional on the specification 
of the model being correct, this estimator is consistent, asymptotically 
efficient, and asymptotically normally distributed. In fact, the point 
estimates derived from applying the two procedures were invariably 
similar, and consequently only the maximum-likelihood estimates are 


reported. 


19 For evidence on the association between wages and firm size in the newspaper 


industry, see Dertouzos (1979). 
20 In particular, we employed the Berndt, Hall, Hall, and Hausman (1974) computa- 


tional procedure. 
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The Cincinnati Local 


These procedures were first applied to an excellent data set collected 
for a newspaper, the Cincinnati Post, for the years from 1946 to 1965.7! 
In estimating the stochastic form of equations (8) and (9), w¢ is 
measured by the hourly wage scale for journeyman printers em- 
ployed by the Post in year t, L; by the number of full-time typographi- 
cal workers in the Post composing room, p; by the BLS consumer price 
index, X; by the amount of advertising linage (in thousands) sold 
annually by the Post, r, by the price of newsprint (in hundreds of 
dollars per short ton) quoted in national markets, and r, by the BLS 
wholesale price index for machinery and equipment.” In 1958, the 
Post merged with one of its two local competitors, the Cincinnati 
Times-Star, and, as a result, a larger and more efficient plant was 
occupied. The ITU members of the Times-Star’s chapel were absorbed 
into the Post composing room so that the employment of typog- 
raphers rose from 145 averaged over the years 1946-57 to 232 
averaged over the years 1948-65. Thus the merger dummy variable, 
D, in equations (8) and (9) takes the value of zero from 1946 to 1957 
and of unity for the later years. The full information maximum- 
likelihood estimates of the structural parameters of equations (8) and 
(9) are given in row a of table 1. 

With respect to the parameters of the union’s objective function, 
the estimate of 6 is significantly less than .5 and greater than 0; the 
union values both employment and wages. The estimate of y is 
significantly greater than zero, suggesting that it is not merely the 
level of wages that matters to the union but, rather, the excess of 
wages over some reference level. In fact, the estimate of y of around 
2.1 ranges from 26 to 47 percent below the real wage actually received 
by these workers over these years. In contrast, the point estimate of 5 
is negative though insignificantly different from zero. According to 
the estimates in row 4@, the elasticity of the maximum value of the 
union’s objective function with respect to wages (.55) is slightly greater 
than the elasticity with respect to employment (.45) when evaluated at 
the sample mean values of the variables. The implied elasticity of 
substitution of wages for employment at this point is .69. Using a 
conventional likelihood ratio rest, the null hypothesis that the non- 
wage arguments of the labor demand function (namely, 7,, 72, X, and 
D) are jointly insignificant in the real wage equation (9) may be 
rejected at the 5 percent level of significance. These results are con- 


*1 In the course of performing consulting activities for the Cincinnati Post, one of us 
obtained employment data normally not available through public sources. 

* In some estimates, the machinery price index was replaced by a measure of the 
user cost of capital, but the results were essentially unchanged. 
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sistent with the notion that the union takes account of the employ- 
ment effects of its wage policy. 

As for the parameters of the labor demand function, the implied 
wage elasticity of employment ranges from about —1.8 to —1.0 with 
an elasticity of —1.23 at the mean values of the variables.?* The 
elasticity of employment with respect to output is estimated at 38 at 
the mean values of the variables.24 The estimates imply that newsprint 
and machinery are each substitutes for labor for all observed values of 
the variables. At sample mean values, the elasticity of the demand for 
labor with respect to the price of newsprint is estimated to be .77 and 
that with respect to the price of machinery to be .46.2> The estimated 
coefficient on the merger dummy (@,4 = 87.9) is virtually identical to 
the difference between the average number of typographers em- 
ployed after the merger and before the merger (namely, S 7a 

At this point, our estimates were contrasted with those. corre- 
sponding to two popular hypotheses of the objectives of unions. First, 
under the wage bill maximization hypothesis, 6 = .5 and y = 6 = 0, 
and the estimates of the labor demand function fitted under these 
constraints are given in row 6 of table 1. A conventional likelihood 
ratio test finds that the unconstrained estimates in row a represent a 
significant improvement (at the 5 percent level) over those fitted 
under the wage bill maximization hypothesis in row 6. Second, a test 
of the rent maximization hypothesis requires an interpretation of y as 
the wage rate that typographers would earn if they were not or- 


3 It is not straightforward to compare these estimates of labor demand elasticities 
with others in the literature since these relate to a small group of workers in a single 
firm, while previous estimates have been at a much higher level of aggregation. 

*4The assumption that output is exogenous in these equations was considered. For 
instance, errors in measuring output may arise because our index of newspaper output, 
namely, advertising linage, ignores dimensions of output such as quality and the space 
devoted to news. For this reason, some instrumental variable estimates were derived 
where the instruments for newspaper output consisted of the city-zone household 
population per year, aggregate U.S. advertising expenditures per household in real 
dollars, and the share of aggregate advertising expenditures earned by television. 
Clearly, these variables are exogenous to the individual newspaper firm. The resulting 
estimates of the parameters of the labor demand function were very similar to those in 
row a of table 1. 

» This finding that newsprint and typographers are substitute inputs is consistent 
with the stylized facts in the newspaper industry. For as the relative price of newsprint 
rose, firms found it optimal to conserve on newsprint by producing zoned editions of 
newspapers, and typographers were required to compose advertising copy for each 
edition’s separate sections. 

*® Augmenting the labor demand function with a linear time trend variable produces 
similar results to those reported in row a (table 1), while the estimated coefficient on the 
trend variable is about one-quarter its estimated standard error. To be specific, the 
maximum-likelihood estimates of this system are as follows (with estimated SEs in 
parentheses): 6 = .201 (.053), y = 2.126 (.880), 8 = — 130.02 (283.58), a» = 196.4 (91.8), 
a, = —80.99 (27.59), a. = 1.087 (1.451), a3 = 3.315 (2.653), a, = 85.14 (15.89), and 
with the estimated coefficient on the time trend variable .290 (1.302). 
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ganized in a trade union. This alternative, nonunion, wage rate is not 
observed, of course, but a plausible specification is that this alterna- 
tive wage is a linear function of what some existing nonunion workers 
receive. In particular, suppose that, in the absence of the ITU, these 
workers would earn a linear function of the real average hourly earn- 
ings of nonsupervisory workers in the retail trade: y = yo + y,(v/p).27 
With this specification for y, the rent maximization hypothesis implies 
the restrictions 6 = .5 and 6 = 0. The estimates corresponding to this 
formulation are presented in row c of table 1 with the unconstrained 
estimates in row d. A comparison of the values of the likelihood 
function in rows and d results in a rejection (at the 5 percent level) of 
the null hypothesis that the unconstrained estimates are no different 
from those implied by the rent maximization hypothesis. Therefore, 
the objectives of the ITU at the Cincinnati Post are not adequately 
described by the maximization of the wage bill or by the maximization 
of rents. 

The estimates in row d of table 1 correspond to a specification in 
which y is a linear function of an alternative real wage rate, while 6 is a 
constant. A natural generalization is to permit 6 to vary, and, on 
investigation, it was found that 6 appeared to take on a higher value 
after the 1958 merger which resulted in the ITU members of the 
Times-Star chapel being absorbed into the Post’s composing room. The 
final two rows of table 1 augment the specification of y = yo + y,(u/p) 
with the specification of 6 = 6) + 6,D in rowe. This more general for- 
mulation yields the highest point estimate of 6 (namely, 490) that we 
encountered for Cincinnati. However, it is not clear what importance 
should be attached to this because the implied estimates of y and 6 are 
greater than the corresponding mean values of real wages and em- 
ployment, respectively, so the objective function is not defined at 
these values. This particular outcome probably results from too re- 
strictive a specification of the movements in y and 6, but, unfortu- 
nately, we are handicapped in our investigation of this matter by 
insufficient degrees of freedom provided by this data set. In general, 
of all the coefficients estimated, we found the point estimates of the 
employment parameter 6 to be the most sensitive to slight alterations 
in the specification of the model. 

Estimates that are more consonant with the Stone-Geary specifica- 


27 To forestall any confusion, the assumption here is not that, if they were not 
unionized, these typographers would be workers in retail trade. The assumption is that 
if the typographers were not unionized and remained typographers, their wage over 
time would be some proportion of what nonunion workers in retail trade receive. The 
v series used relates to the United States as a whole. An hourly earnings series for 
nonsupervisory workers in the retail trade in Ohio is available only after 1952 and in 
Cincinnati after 1957. However, for the years from 1953 to 1966, the Ohio series and 
the U.S. series move very closely together. 
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tion are given in row f of table 1, but they are obtained only after 
arbitrarily restricting the parameter 6, to a value of zero. According to 
the results in row f, the elasticity of substitution between real wages 
and employment in the union’s objective function is .305 at sample 
mean values of the variables. The elasticity of the maximum value of 
the union’s objective function with respect to wages (.97) is virtually 
identical to the elasticity with respect to employment (.98). The esti- 
mate of y, is consistent with the hypothesis that the ITU’s reference 
wage, y, rises dollar for dollar with increases in the wages of non- 
union workers in retail trade. The merger raises 6 by almost exactly the 
amount by which the merger shifts the labor demand function (as 
given by a4). At sample mean values, the elasticity of employment with 
respect to wages is —1.00, that with respect to output is .53, that with 
respect to the price of newsprint is .55, and that with respect to the 
price of machinery is .44. 

Finally, to illustrate the use of the union’s indirect objective func- 
tion equation (11), we formed the index V/Vo, where V9 and V; 
indicate the values of the indirect objective function in the year 1946 
and in each subsequent year ¢t, respectively. This ratio, V/V», may be 
interpreted as an index number of the union’s objective function 
given the observed values of p, 7,72, and X in year t relative to its value 
in the base year 1946. Calculating this welfare index with the 
parameter estimates from row f of table 1 yields the following results: 
for 1946, V,/V) = 1.00; for 1950, 1.46; for 1954, 1.50; for 1958, 1.57; 
for 1962, 1.55; and for 1965, 1.76. This registers a 76 percent in- 
crease in the union’s welfare over this period. In contrast, the real 
wages of these typographers rose by 41 percent between 1946 and 
1965 while employment grew 118 percent. 


Other Union Locals 


To examine the generality of this framework of union behavior, data 
for the 1946-65 period were collected for seven other local chapters 
of the ITU. Unlike the data for the Cincinnati Post, observations on 
firm employment are not readily available so that for these seven 
locals we used data on local union membership instead.?8 The result- 
ing seven union locals range in size from Fond du Lac, Wisconsin, 


*8 Data on local union membership may be obtained from the union’s publication 
Typographical Journal. These union membership data include those employed in com- 
mercial printing establishments as well as those in newspaper firms. We were obliged to 
use the union membership data; though, to minimize measurement problems, large 
cities with substantial printing industries (such as New York and Chicago) were ex- 
cluded from our analysis. For most union locals, commercial printing employment 
constitutes a very small fraction of the membership. 
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with a mean journeyman membership of 41 over the 1946-65 period, 
to the Columbus union with average membership of over 600. In each 
city but one, the wage and advertising linage variables relate to the 
single newspaper firm in the city.?® The stochastic forms of equations 
(8) and (9) were fitted to the time-series data for each of the seven 
unions with estimates given in table 2. 

As far as the union’s objective function parameters are concerned, 
first observe that the estimates of y are greater than the mean values 
of real wages for Elmira and Fond du Lac and the estimates of 6 
exceed the mean values of employment for Augusta, Elmira, and 
Fond du Lac. As was the case for the estimates for Cincinnati in row e 
of table 1, this is a consequence (we surmise) of not accounting 
adequately for movements in the reference levels, y and 6.*° For the 
other four unions, the estimates of y are significantly greater than 
zero in all cases, while for Columbus only would the estimate of 6 be 
evaluated as greater than zero at conventional levels of significance. 
The point estimates of 6 range from .15 in Dubuque to .79 in Colum- 
bia. Except for Dubuque, the estimates of 6 are significantly greater 
than zero and smaller than unity.*! Comparing the results for Colum- 
bia, Columbus, Dubuque, and Memphis, the tendency is for the 
estimates of the “reference” parameters, yy and 6, to be greater for the 
larger unions. These larger unions are located in bigger cities where 
opportunities for alternative employment (especially in commercial 
printing) are greater and where wage levels are higher. In particular, 
the significance of y suggests that the union’s wage policy is deter- 
mined with reference to some alternative or comparison wage rate, 
and models that ignore this aspect of union behavior are neglecting 
what appears to be a consistent finding in this study. In this regard, 
using a likelihood ratio test, we are able to reject (at even the 1 percent 
level) for all of the unions the hypothesis that the unconstrained 
estimates are no different from those implied by the wage bill 
maximization hypothesis (under which 6 = .5 and y = 6 = 0). 

The estimates of the parameters of the labor demand function are 
probably best understood in terms of their implications for the elas- 
ticity of demand for labor which are given in the final three columns 


28 Columbus is the single exception: In this city the data are aggregated over several 
newspaper firms. We follow Rosse (1978) in treating agency shop newspapers (Le., 
“enterprises with separately owned and produced editorial products but with joint 
mechanical production and joint sale of advertising and subscriptions” in the same city 
[p. 3]) as a single firm. : es 

© On occasions, permitting the reference levels y and 6 to vary resulted in a likeli- 
hood function with no pronounced maximum. 

31 Fyaluated at each union’s sample mean values, the elasticity of substitution between 
wages and employment in the union’s objective function is estimated at .70 for Colum- 
bia, .04 for Columbus, .49 for Dubuque, and .20 for Memphis. 
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of table 2. The wage elasticities evaluated at the sample mean values 
of the variables for each union (so the point of evaluation differs 
across unions) range from —5.59 for Elmira to —.25 for Dubuque, 
though the standard errors attached to these estimates caution against 
confident inferences: Clearly, a wide range of elasticities is compatible 
with the data. 


V. Conclusions 


The purpose of this paper is to report the consequences of an empiri- 
cal analysis of a well-known model of union behavior. In general, our 
results indicate that the ITU’s objectives cannot be identified with 
wage rates or with employment alone, but that both are arguments of 
a more general objective function. Our estimates accord with those 
models in which the union is characterized as setting its decision 
variables with reference to some norm, since it appears to be the excess 
of wages and employment over some reference levels that is relevant 
to union behavior. However, the relative weight attached to super- 
numerary wages versus supernumerary employment varies consider- 
ably across unions. The wage bill maximization hypothesis that ig- 
nores these reference levels has been found in every case to provide 
an inferior explanation of movements in wages and employment 
compared with our more general formulation. Nevertheless, in some 
cases, our more general specification is insufficiently general, but the 
opportunity to relax the restrictions on the reference level parame- 
ters, y and 64, is severely limited by the available number of obser- 
vations on each union. It should also be emphasized that an evalua- 
tion of the empirical performance of this model tacitly involves joint 
tests on the specification of the union’s objective function and of the 
labor demand function, since expressions are required for both func- 
tions in order that a model of this kind may be implemented. 
There are two particularly important areas of union behavior 
where the model described in this paper is inadequate. The first 
concerns the various job security provisions in many union contracts 
which, if effective and binding, should be modeled explicitly. Of 
course, though they exist in many collective-bargaining contracts, the 
extent to which the employer can evade their constraints has not been 
determined in other than a few instances.®” The second area that 


32 One instance is Hartman’s (1969) excellent study of the effects of the mechaniza- 
tion and modernization agreement in the Pacific Coast longshoring industry. His 
analysis suggests that those work rules which took the form of specifying fixed quan- 
tities of labor input were essentially evaded by the employers by an appropriate 
arrangement of the flow of work. On the other hand, those work rules that governed 
the mix of machinery to labor did appear to affect employment and productivity. 
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requires more research concerns the intertemporal aspect of the 
model: Clearly, the objectives of unions are defined over the future 
values of variables as well as their current values, while the employer’s 
immediate response to a negotiated settlement may well differ from 
his longer-term response. Any serious attempt to model trade union 
behavior must eventually come to grips with these issues. What has 
been described in this paper is only an initial step toward that more 
satisfactory model. 
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The paper provides a theoretical description of hedging in futures 
markets that accounts for the behavior of a broad class of agents. 
Specific optimal decision rules are derived for agents concerned with 
the mean and variance of profit. These rules are used to evaluate 
how optimal cash and futures positions are related to price ex- 
pectations, the production possibilities, and the number of futures 
markets available. 


I. Introduction 


Futures markets in commodities, foreign exchange, and financial 
assets are thriving with increased participation by a broad spectrum of 
hedgers and speculators. Despite this fact, there is no satisfactory 
systematic account of the optimal use of futures markets by hedgers 
and of the way real output decisions are affected by the availability of 
futures markets. Conventional descriptions (such as that of 
Hieronymus [1971]) seem to propose a view of hedging as taking a 
position in a futures market that is equal in size and opposite to a 
predetermined position in the cash market. Such a simple view does 
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not square well with the observed diversity and sophistication of 
futures markets participants. This paper is an attempt to provide a 
theoretical description of hedging that could better account for the 
behavior of a broad class of hedgers. The following facts and results 
are underlined: (1) Optimal cash and futures positions are normally 
determined simultaneously. The availability of hedging opportunities 
affects production or trading decisions. (2) The optimal hedge can be 
viewed as the sum of a risk-minimizing position and a speculative 
position. (3) The traditional “routine hedge” (which prescribes the 
futures position to be the exact opposite of the cash position) in 
general is suboptimal in that it cannot be identified as a risk- 
minimizing position or as an optimal speculative deviation from the 
minimum-risk position. (4) In the absence of transaction costs and of a 
“perfect hedge,” hedgers will take advantage of the better risk-return 
performance associated with simultaneous trades in a multiplicity of 
futures contracts. (5) When a limited number of contracts seems 
appropriate (as a result of transaction costs or indivisibilities), the 
question of which contract or combination of contracts gives best 
hedging results arises. This can be answered by making use of avail- 
able statistical information. (6) When dealing in a good for which no 
futures exists, a “cross hedge” may be appropriate; the best cross 
hedge may be calculated in exactly the same way as a standard hedge. 
(7) The allocation signal to be used by a hedger for his production 
decision is a linear combination of his expectations and the relevant 
futures prices. 

More specifically, we derive operational formulae for individuals 
concerned with the mean and variance of their profit. We acknowl- 
edge the fact that cash and futures contracts are rarely perfect sub- 
stitutes, or that the relation of the cash price to the relevant futures 
price at the time of delivery may not be predicted with certainty (this 
is known as basis risk); and we recognize that the rarity of a perfect 
hedge may justify a portfolio approach to hedging, whereby risk 
reduction (and speculation) is achieved through dealing in multiple 
contracts. We derive criteria for choosing the best contracts with 
which to hedge and evaluate the benefits associated with the existence 
of hedging opportunities. 

Previous literature had raised some of the deficiencies of the routine 
hedge prescription. Thus Working (1962) emphasized that the exis- 
tence of basis risk prevents the elimination of all risk and was led by 
his observation of hedging practices to the notions of “anticipatory” 
and “selective” hedging (where deviations from routine hedge are 
based on price level expectations). Telser (1955), Johnson (1960), 
Stein (1961), and Peck (1975) have analytical models of optimal 
hedging that share some of the characteristics of ours. Johnson and 
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Stein derive optimal hedges for mean-variance utility maximizers, 
recognizing the simultaneity of cash and futures decisions. Telser 
does the same for hedgers following a safety-first rule of decision. 
However, none of these contributions explicitly treats cross hedging, 
none admits of multiple futures markets, none specifically studies the 
impact of hedging on real decisions, none studies the conditions that 
permit the separation of real and futures decisions, none specifically 
recognizes that a hedger’s position contains a component that is 
equivalent to a pure speculation, and none discusses and analyzes the 
pricing role of futures markets.! 

In Anderson and Danthine (1978), we use a general expected- 
utility framework to describe the simultaneous production and fu- 
tures decisions of a variety of hedgers in the face of price and quantity 
uncertainty. However, there we were more concerned with describing 
market equilibrium and did not allow for basis risk or multiple cross 
hedging. Our use here of a mean-variance utility formulation is dic- 
tated by our desire to obtain the operational results which we believe 
are a strength of the paper. This framework is equivalent to 
expected-utility maximization if net revenues are distributed nor- 
mally and agents’ utility functions are exponential. In our opinion 
the framework can be justified as well on the grounds that expected 
returns and (some measure of) risk are the interesting practical crite- 
ria for deciding hedging strategies. Similar results could have been 
obtained less elegantly by allowing a general expected utility formula- 
tion, expanding first-order conditions by a Taylor series, and arguing 
reasonably that the first and second moments of the distribution 
dominate the higher-order terms and are good proxies to be used for 
real world decision making. 


II. The Basic Model of a Futures Market Participant 


The set of futures markets participants is far more diverse than the 
traditional dichotomy of hedgers and speculators suggests. However, 


‘In a study which came to our attention after the present paper was written, Rolfo 
(1980) applies a model similar to ours to the world cocoa market. Focusing on the 
futures position of a short hedger, he derives the analogue of our eq. (5) for a case of 
production uncertainty. In that we choose to generalize the model in other directions, 
Rolfo’s contribution complements ours in an empirically important way. However, the 
similarity of our (5) with his eq. (1) demonstrates that, analytically, production uncer- 
tainty does not introduce an essentially new dimension and that we can ignore it in our 
attempt for an otherwise more general analysis. We believe Rolfo is not correct in 
maintaining that production uncertainty is the sole or necessarily the main reason for 
the optimal hedge to deviate from the routine hedge (see his criticism of the quote of 
Hieronymus [1971]). In Sec. III below, we see the same deviation in our simple hedging 
model without production uncertainty, including the possibility (noted by Rolfo) of the 
optimal hedge being of the same sign as the cash position. 


CROSS HEDGING 1185, 


a single, simple model suffices to characterize the principal features of 
optimal decisions by these agents. In our framework a decision maker 
commits himself in period 0 to an amount, y, to be sold (or, as will be 
seen, bought) in period 1 at the then-prevailing price p,. By making 
this commitment the agent incurs a cost (generates income) which 
depends on the quantity according to the nonstochastic cost (revenue) 
function c(-); c(-) represents the period 1 value of costs that may be 
incurred in either period. This describes the real production decision 
or, in the language of futures markets, the cash position of the agent. 

The agent may also deal in futures. We assume the existence and 
availability of n different futures contracts differentiated by delivery 
month; delivery location; and type, quality, and amount of the good 
delivered, but all being traded at the two dates relevant to the agent’s 
plan. For the 7th market let; be the amount of the futures sold at time 
0 (f; < 0 represents a purchase). This position is closed out by an 
offsetting trade at time 1. Denote the price of futures 7 at time ¢ as pj. 
Let f and p{ be the column vectors of the n futures positions and 
prices. Explicit consideration of the margins necessary for dealing in 
futures markets is omitted, but the cost of these margins may be 
implicitly reflected in p>. In period 0, period 1 prices are unknown 
and are viewed as random variables p , and pi whose joint distribu- 
tion, as perceived by the agent, has the following moments: 


Ep,=p1, Epi=p1, 
var p1=Vo0, var phr=Ci, 
cov piph=cw cov piip1= Gio. 


Let the (n + 1) X (n + 1) variance/covariance matrix & be partitioned 


as 
ae i a 
Yi |S 


where 27, 1s 7 < 1,29, = 249 1s 1 Xn. 
As of period 0, the agent’s period 1 net revenue is the random 
variable, 
a= py —c) — (bi - POS. (1) 
The net revenue relation (1) may be interpreted to represent that of 
any of a large number of futures participants. Specifically, the storage 
operation, the traditional example of a short hedger, 1s accommo- 
dated if we define c(y) as the cash value of y (y > 0) in period 0 plus the 
total costs of storing y. For the producer of the cash good, for exam- 


ple, a farmer, the cost function is a composite of all real production 
costs. We can also model the decision of a long hedger, for example, 
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that of a processor who sells a forward merchandising agreement for 
the processed good. In that case the agent commits himself to a 
purchase (y < 0) of the cash good which will be processed and sold 
with a known price yielding a revenue (c[y] < 0). Finally, a speculator 
has no cash position (y = 0, c[0] = 0). In view of these remarks we 
assume ¢(0) = 0,c'(-) > 0, c"(-) = 0. The decision maker selects y andf 
so as to maximize a linear function of the mean and variance of net 
revenue. Thus the agent’s problem is 


max Ew — ex var 7, (2) 
uf 


where x is a positive parameter reflecting the agent’s risk aversion and 
m@ is given in (1).2 The necessary conditions for a maximum are 


fi-¢'(y) — X(VO — Loaf) = 0 (3) 
and 
(po — pi) — X(2uf — y2i0) = 0. (4) 


These are sufficient given c” > 0, and > nonnegative definite as the 
maximand of (2) is then concave in Ore 


III. Optimal Futures Positions 


Given y, the futures vector satisfying (4) is unique if and only if &,, is 
nonsingular. In this case the optimal futures position is 


jos = iP 0 PA Oat io: (5) 


where y is the optimal cash position. In the present subsection we 
interpret this expression in the light of some recognized patterns of 
hedging and speculation. 

First, note that fora speculator y = 0, so that his optimal position is 


yet le eee Ts 
x Ualhe p'): (6) 


A speculator’s optimal position in any given contract depends on 
the complete vector of expected returns, the covariance matrix, and 
parameter x. Assuming they agree on the distribution of futures 
prices, all speculators purchase the same “mutual fund” of futures, 
2i(p> — pi). The amount held of this speculative mutual fund is the 
inverse of the agent’s degree of risk aversion. As we would expect 


* We have placed no constraints on the range of y and f; in particular, there is no 
wealth constraint. This implies the availability of financing for any sized futures posi- 
tion at constant rates reflected in the price p%. 
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from basic portfolio theory, it is only in special cases that knowledge 
of the sign of pf, — pf, is sufficient to determine the sign of f;. For 
example, if pj is uncorrelated with the remaining futures or if it is the 
only available future (n = 1), the speculator goes short (long) when 
the price of 7 is expected to fall (rise). 

Having noted the position of a pure speculator, we see that the 
futures position of any hedger (y 4 0) can be decomposed into two 
parts. The first part, y27'210, may be viewed as a pure hedge, and 
the other may be viewed as a pure speculation since it is formally 
equivalent to the speculator’s position. The pure hedge component 
may also be viewed as participation in a “hedger’s mutual fund.” This 
fund, &7{210, is the same for all hedgers who hold the same views of 
the second moments of the price distribution; the degree of partici- 
pation is determined by the size of the agent’s cash position. 

Further interpretation of the pure hedge component may be 
facilitated if we first consider the case of a single cross hedge where it 
equals yoo;/0;,. Here we observe that the sign of the pure hedge 
position depends on the sign of y and that of oo;. In most common 
hedging activities, the futures is specified in the same basic good as the 
cash good (albeit of a possibly different grade and at a different 
location) so that d), > 0. Then for those who are committed to a 
future cash sale the pure hedge position is short, and for those 
committed to future cash purchases the pure hedge position is long. 
These particular cases conform to the usual designations of farmers 
and storage companies as short hedgers and of processors of a futures 
traded commodity as long hedgers. 

The pure hedge stated as a proportion of the cash position is given 
by Los lope | = |p | V do9/01;, where p is the simple correlation be- 
tween cash and futures prices. From this we can see that the pure 
hedge is the “routine hedge” of taking a futures position equal in size 
to the cash position only if this proportion is unity. If cash and futures 
prices have comparable variance but, as is widely expressed in the 
industry, are imperfectly correlated, the pure hedge position is small- 
er than the cash position.* In most familiar examples of hedging, the 
same basic commodities are traded in the cash and futures markets, 
and price differences are attributable to grade, location, and timing 
differences. However, many cash goods do not have obvious futures. 
In these cases a “cross hedge” may be attempted by taking position in 
a futures for a related commodity. Equation (4) holds in this case also. 
Note that cross hedges are in order whenever the cash/futures corre- 


3 The common ironic remark that farmers are the biggest speculators because they 
have crops in the ground without a matching short position in futures may or may not 
be correct. Our framework shows that underhedging may result from relatively low 
correlation and/or high variance of futures, or from an expected futures price increase. 
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lation is a constant different from zero. Thus, for example, one might 
attempt to hedge an inventory of cash good by purchasing futures for a 
good which is a complement in the sense that oo, < 0. 

The generalization to multiple futures contracts says that the pro- 
portion of output that should be hedged in each contract is given by 
the coefficient of the theoretical multiple regression of cash prices on 
the n futures prices (27}24 is the vector of these multiple regression 
coefficients). This result suggests that the partial (given remaining 
futures prices) correlation coefficient between the cash price and a 
futures contract is a better indicator of the usefulness of that contract 
for hedging purposes than is the simple correlation coefficient. Thus 
it is conceivable that the optimal hedge against a long cash position 
involves a long futures position in a contract even when the (simple) 
correlation between the cash and that futures price is positive. 

Finally, it is worthwhile pointing out that the pure hedge term, 
y241%10, is equal to the risk-minimizing futures position corresponding 
to a predetermined cash position y. A hedger thus uses the possibilities 
offered by futures markets to minimize his risk, then he speculates. 
This interpretation should not, however, conceal that futures and 
cash positions are jointly determined and that the risk-minimizing 
futures position is generally suboptimal since it ignores the oppor- 
tunities for speculative return and because it is decided conditionally 
on a cash position that will generally be suboptimal. It should be clear 
from the previous analysis that the larger | Toon | , the more effec- 
tive in reducing risk is a single cross hedge. This criterion could be 
used for selecting which delivery month, which exchange, or even 
which underlying commodity would offer the best risk-reducing op- 
portunity. 


IV. The Optimal Cash Position 


Equation (5) characterizes the optimal futures position, given y; in- 
serting (5) into (3) gives the necessary condition for y given that f is 
adjusted optimally as y varies. After simplification, this condition is 


pi ne Dora fo a p') — YX (Goo 7 Deitata4o) ad c'(y) == aA) (7) 


Equation (7) is nothing else than an equalization of marginal reve- 
nue and marginal cost. The latter is the sum of the two last terms: c'(y) 
is the marginal cost of production; XY(Foo — Yor211210) is the marginal 
risk premium. The risk premium can be rewritten as XYFoo(1 — R?), 
where R? is the theoretical multiple correlation coefficient squared. 
Thus the risk premium depends on the degree of risk aversion, the 
size of the cash position, the variability of cash prices, and the propor- 
tion of that variability which cannot be eliminated through the opti- 
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mal hedge in the n futures contracts. For a given cash price variance, 
the risk premium decreases as the multiple correlation coefficient 
increases; consequently, the higher the R?, the larger the cash position 
(in absolute value). 

The two first terms of (7) can be viewed as a “planning price,” used 
as a substitute for the as yet unknown cash price. The planning price 
is a weighted sum of the expected cash price and all current and 
next-period futures prices. The weight given each futures price de- 
pends on its coefficient in the multiple regression and thus corre- 
sponds to the proportion of the cash position hedged on that particu- 
lar market. Again it is enlightening to look at the simple cross-hedge 
case (n = 1); the planning price can then be written 


(Bhi Bip) — 8), (8) 
where B = 0 o;/o,, and B = p', — px is the expected value of what is 
known as the basis, the difference between futures and cash price. 
The appropriate allocation signal for the decision maker is now a 
linear combination of the expected cash price and the currently 
quoted futures prices “scaled” for the expected difference between 
futures and cash prices. The interest of this formulation is in the fact 
that people in the trade often assert that bases (i.e., price differences) 
are more easily predictable than price levels. Equation (8) shows that 
the closer the relationship between cash and futures, the more the 
hedger can rely on futures price and basis expectation rather than 
price expectation for “pricing” his output. 

From this analysis, it is quite clear that an increase in the absolute 
value of the covariance between futures and cash prices, | 0»; |, may 
have conflicting effects on real production. On the one hand, it 
decreases the riskiness associated with the “real” activity of the pro- 
ducer; hence the proportion of marginal revenue allocated to risk 
remuneration decreases and the cash position can be made larger in 
absolute value. On the other hand, the heavier reliance on the futures 
price in making production decisions may entail a smaller cash posi- 
tion depending on its level relative to the expected cash price. 

In the general case (n > 1), we can also interpret the planning price 
in terms of the vector of bases, of the form pi, — p;. For the first two 
terms of (7) can be written as 


(1 — Bu)p, + B(P4 — By), (9) 


where B = %o12i7, e is the unit vector, and B= Pi — py (the vector 
of the expected bases).* Thus observed futures prices and basis ex- 








4 We could also conceive of the following alternative generalization of basis for the 
case of n > 1. Let the generalized basis Bg be Bg = 2o:211p' — fi; this is the difference of 
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pectations may substitute for cash price expectations in the output 
decisions. In the special case where the multiple regression coeffi- 
cients sum to unity (Bt = 1), cash price expectations disappear com- 
pletely from the decision. 


V. Adding Futures Markets 


In this section we examine how an agent’s decisions can be altered by 
extending the range of available futures markets. Since we posit no 
fixed costs in transacting in a market and no costs to formulate a 
probability distribution for the futures prices, it is clear that extending 
the list of futures markets will not decrease expected utility.5 Fur- 
thermore, the introduction of new futures will induce a restructuring 
of the entire portfolio so that the effect on the speculative positions is 
difficult to predict. Consequently, we concentrate our attention on 
how the optimal cash position may be affected. 

Specifically, we consider whether the cash position will increase or 
decrease in size when the list of futures is extended. The basic result is 
that when we move from m — 1 futures to m futures, the absolute 
value of the cash position increases if the full vector of futures prices 
is not expected to change from period 0 to period 1. This result allows 
us to argue by induction that increasing the list of futures from m to 
m* (m* > m) will increase real production when all these futures 
present no incentive for speculation. The reason for excluding incen- 
tives for speculation is that, when such incentives exist, the addition of 
futures markets may induce a riskier futures position such that the 
required risk premium in the cash good first-order condition in- 
creases. 

To formalize the result, let us partition the full vector of m futures 
according tof’ = (f1, fm), where f, is an m — 1 vector. Similarly, let pf’ 
= (P14 Pine), and let the variance/covariance matrix of (p15 P't> P'm1) be 


the cash price and the price of the hedger’s “mutual fund” (see eq. [5]). Then the 
planning price becomes Yo.ritPo — Bg. This is the current price of the hedger’s mutual 
fund minus the expected generalized basis. This interpretation is theoretically attrac- 
tive. However, defined in this way, the expected basis cannot be interpreted with 
expectations on price differences only. Hence it does not retain the practical attractive- 
ness of the usual motion of the basis which is due to observation that price differences 
are more stable than price levels. 

° It should be emphasized that the analysis of this section takes the current prices and 
the distributions of future prices as given. General equilibrium effects of adding 
futures markets cannot be traced out in this context. In a general equilibrium analysis, 
Hart (1975) shows that adding more futures markets does not necessarily increase 
welfare. 
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S00 Do1 Tom 
x= 210 a mals 
Omo ae Omm 


Where G9, Tom, ANd Gym are scalars; ¥,, is (m — 1) X (m — 1); and So, 
and X,; are 1 X (m — 1). We assume > is nonsingular. Then net 
revenue is 


T = py - c(y) = (pi: me 40) fi oe (Poa = Pola (10) 


If we consider maximizing a linear mean variance function of 7 
with respect to y,f,, and f,,, the necessary conditions are, respectively: 


pr a c'(y) ry X(9F%o0 Fi Loft oe Conan) = 0, (11) 

Pio ate ae X(-yX10 ts af ae em = 0; (12) 
and f 

a6 re a a X(-9 Com =F ia ae Cinna) = 0. (13) 


Let y**, f#*, and f%* be optimal with a full vector of futures; then 
they solve (11)—(13). Let y* and f# be optimal with a restricted list of 
futures; then they solve (11) and (12) withf,, = 0. We will show that if 
pi: = pio and ph, = pho, then |y**| = | y* | with strict inequality when 
3 Ot loess non-null. 

Under these assumptions, (12) implies f# = y*2;]210. Inserting this 
into (11), we see y* satisfies 


Pi eC NX (Ogg Yor2it210) = O. (14) 


Similarly, using (12) and (13) to solve for f#* and /%* and inserting 
these into (11), we can see that y** must satisfy 


Pi = 6 (yr) — y*x| Goo eit ip) a 
1 


ZOmm 


where z = [1 — (1/Omm)2m121121m]. Now it can be shown that z > 0.8 


(ope Emi Zit Zi0"| = 0, (15) 





6 
ee (1 i. = eat 


m 


1 
= det ( ae 


11 > m 
= det Z ; \ jem det 241. 


m1 Omm 





Zinn )/det ae 


Here we have repeatedly made use of the properties of determinants of partitioned 
matrices (see Dhrymes 1970, pp. 570-71). In the last expression Gmm, 241, and the term 
in parentheses are all minors of 2, a nonsingular variance/covariance matrix by as- 
sumption. Thus the last expression is strictly positive. 
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Then comparing (14) and (15), we see that if oo or 249 is non-null, 


a (Tom aac > PEED 0? | 
ZO 


Pie Ce yx] 700 — Yor2ii20 — 
mm 
(16) 
is positive if y* > 0 and negative if y* < 0. Given the properties of c(-), 
we see |y**| > |y*|. If both oom and 2%, are null, expression (16) 
reduces to (14), and y** = y*. 
As an interesting illustration of the above, consider passing from 
zero to one futures market. In this case y** satisfies 


(1 — B)pi + Blpo — B) —c'(y**) — xy**oo0(1 — p?) = 0 (17) 
(here we allow the possibility that p{ # p%). Similarly, y* satisfies 
pi CANE) RX Coosa 0 (18) 


Using the properties of c(-), it is indeed easy to see that, provided that 
(p> — B) does not differ too widely from p and that futures and cash 
prices are correlated (p # 0), | y= | 2 | y* |. The introduction of 
futures trading leads to increased real production. However, if fu- 
tures and cash prices are sufficiently out of line, the reliance on p4, for 
production decisions may suggest more conservative cash positions. 
On the other hand, if p = 0, the futures market is of no help for 
hedging purposes, and (18) reduces to (17). The original risk of the 
hedger’s real activity is intact (oo); futures trading is limited to plac- 
ing speculative side bets. 

In this latter case, it is clear that the utility derived from futures 
trading is entirely associated with the speculative opportunities of- 
fered. If, however, we abstract from that aspect of futures trading by 
assuming p{ = p%4, it will be clear that utility increments also result 
from the hedging opportunities offered by futures markets; the size 
of the increment depends on the degree of correlation. To see this, 
substitute (5) into (2) with © ,,, So, p', and f being scalars and differ- 
entiate with respect to y and gp, to get 





d(EU) = lb: ome (pf, pe 90 TF Jo 


O11 


+ pty Ba + = BY Nay, 


O11 O71 
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Under our assumptions, this reduces to d(EU)/doo, = X 971/041). 
Expected utility is increasing in | or ie 

Finally, we derive a monetary measure of the benefit of being able 
to trade futures. For simplicity we derive this for a single futures case; 
similar expressions are possible for the multiple futures case. Let y* 
and y** be interpreted as above and suppose that the agent is able to 
trade in futures only by paying a fixed amount k. Let k* be the amount 
that would just leave the agent indifferent to trading futures or not 
trading futures; k* can be interpreted as the maximum fixed charge 
that the agent will be willing to pay for the right to trade futures. 
Within our framework we find 

(po — pi)? 


XF11 


Reply hey i ale(y* sie (y) ee 





- 2 
+ (po — pi)y** a oe A y##3(o0 = ae ) a yoo} 

As an example of how this expression could be used, we might 
imagine a broker who wishes to assess the service charge a potential 
hedger would tolerate in order to deal in futures. If it is thought that 
the hedger is unlikely to perceive the speculative opportunities (p} = 
p%), the amount the hedger is willing to pay is composed of two parts: 
pily** — y*) — [e(y**) — c(y*)] is the expected increment of net 
revenue on the cash market attributable to dealing in futures, and 
{y**[ G99 — (061/011)] — y**o oo} is the increment of risk in the futures 
regime versus the nonfutures regime. This will tend to be positive if 
| @o,| and c” are relatively small and negative when |oouieande are 
relatively large. To sum up, the amount the hedger is willing to pay is 
his increased expected cash revenue less his increase in risk weighted 
by his degree of absolute risk aversion. 


VI. Restrictions on Stochastic Relations between Cash and 
Futures Prices 


Singular Price V ariance/Covariance Matrix 


When » is singular, knowledge of all futures prices is equivalent, in a 
certain sense, to knowledge of the cash price. This has interesting 
implications for both the optimal cash and futures decisions. 

First, recall that from the properties of partitioned matrices, if 2, is 
nonsingular, 


det 2 = (Go — Lormiita10) det 241 
= (1 ae R7) a0 det sae 
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If % is singular, det } = 0 and we see Go = 2Xo1271210 OT, equivalently, 
R?’ = 1. Thus when the variance/covariance matrix of prices is singu- 
lar, the cash price is perfectly correlated with a linear combination of 
futures prices. 

This restriction is closely tied to the notion of the routine hedge. 
For in this case equation (5) implies 3, f = (1/xX)2u2i(po — pi) + 
Ooo. The covariance weighted sum of futures positions is decomposed 
into a speculative component that is proportional to the regression 
weighted sum of expected futures price changes and a pure hedging 
component that is proportional to the cash position. The restriction 
implied by the singularity of 2 stands out more clearly if we dimen- 
sion (as we are free to do) the cash and futures goods so that oo) = 04, 
=... = On. Then when there is no incentive to speculate we have 
Lh pifi = y, where p; is the semple correlation coefficient of ph, and py. 
That is, the correlation coefficient weighted sum of futures must 
equal the cash position. In this sense, a singular & is associated with a 
“generalized routine hedge.” If there is only one futures contract, 
n = I, the pure hedge is in fact a “proper” routine hedge: f = y. 

The implication of a singular for the cash position is seen im- 
mediately from equation (7) for, making use of (9), it reduces to 


(P= Bt)p, + B(ps — B,) —c'(y) = 0. (19) 


Here the risk premium has disappeared, indicating that the cash 
decision is no longer affected by the level of risk accepted in the 
futures portfolio. 


No Basis Risk 


We have already noted that the interest in the basis is a result of the 
fact that the difference of prices may be more stable than the price 
levels. As a limiting case, the cash and futures prices may be random 
while the basis is not. For example, this would be the case if at the time 
of the futures market expiration the costs of transportation to the 
cash market are nonstochastic. 

If the vector of bases B, is a vector of known constants, oy, =o — 
Goo and thus p; = 1 for all i. Cash price is perfectly correlated with 
each futures price. In this case > is obviously singular, so that equation 
(19) holds. We now have the further restriction that the regression 
coefficients sum to unity. For we have Too = Yor2210 = BX or 1 = 
BX10(1/o) = B(p1... py)’ = Bu. Using this in (19), we see that when 
there is no basis risk, the optimal cash position is given by B(ph — B,) 
— ¢'(y) = 0. The planning price is entirely a function of current prices 
and basis expectations; the expected cash price does not enter the 
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decision. For the single cross-hedge case, this condition, first derived 
in Danthine (1978), reads p§ — B, — c’(y) = 0. 


VII. Basis Speculation 


Working (1962) and others have expressed the view that hedging and 
speculative behavior are distinguished not by the one being risk 
avoiding and the other risk taking but by the kinds of risk accepted in 
the two cases. Speculators take price-level risk, while hedgers take 
price difference (basis) risk. Thus hedgers are said to speculate on the 
basis. 

Restricting ourselves to the n = 1 case, we can give a basis specula- 
tion interpretation to the hedger’s problem by assuming c(y) = poy; 
that is, the hedger is a storage company with costless storage. When 
we rewrite equation (1), net revenue becomes 


im = (p1 — poy — (pi - Pof 
Big — B,)y nF (po— pi)s, 


where s = f — y 1s the excess of the futures position over the routine 
hedge. The agent speculates on the basis by the amount y and on the 
price level by the amount s. Working’s pure basis speculator arbitrar- 
ily sets s = 0. Our agent jointly selects y and s, thus, in general, 
attaining a better risk/return combination. While suboptimal in this 
sense, pure basis speculation may be defended if the agent possesses 
information about B, and var B, but not the moments of the under- 
lying prices. 


VIII. Conclusion 


Our descriptions of optimal hedging decisions are specific enough to 
be implemented and their performance to be evaluated. However, the 
question of empirical applications raises several issues that we have 
not addressed explicitly. 

First, we have ignored the fact that futures contracts are 
standardized as to quantity and must be traded in integer multiples. 
For a small user this discreteness 1s a serious concern and may elimi- 
nate the possibility of multiple cross hedges. Even for the large 
hedger the discreteness of futures markets implicitly may limit the 
number of markets that should be considered in the portfolio. 

A more important concern is the fact that we have assumed that the 
agent has knowledge of the relevant moments of the probability 
distributions of prices. In practice, a considerable effort may be re- 
quired to estimate these moments, particularly if they are subject to 
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change over time. The stability of moments (or of some more general 
structure from which the moments can be derived) is, of course, 
fundamental to our results, as indeed it is to the notion of hedging. 
The limited ability to identify such stable relationships is likely to be 
the most serious constraint in developing very extensive multiple 
cross hedges. In Anderson and Danthine (1980) we address some of 
the problems of estimation and illustrate the approach with empirical 
applications. 

While we feel that we have provided an account of hedging in 
futures markets that is more satisfactory than any other we know, we 
recognize it is not fully general. One restriction is that we have 
assumed a nonstochastic production process. Judging from Anderson 
and Danthine (1978) and Rolfo (1980), which treat production un- 
certainty, the results of the present paper would not be seriously 
altered by the relaxation of this assumption. Finally, in assuming only 
two trading dates, we have suppressed the possibility of changing the 
futures position many times during the period of holding an open 
cash position. In Anderson and Danthine (1981) we consider the 
problem of hedging in a single futures market but with many possible 
trading dates. 
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The Implications of Competition among 
Jurisdictions: Does Tiebout Need Politics? 
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The paper investigates whether competition among local jurisdic- 
tions is, by itself, sufficient to ensure efficient provision of local 
public goods. Jurisdictions have fixed boundaries, and each has an 
entrenched government with the power to tax and supply the public 
good. Residents can move costlessly among jurisdictions. It is shown 
that competition among numerous jurisdictions is not sufficient to 
guarantee public sector efficiency. Though residents can “vote with 
their feet,” land is immobile. Hence, governments can usurp some 
land rents for their own ends. Increasing the number of jurisdictions 
limits but cannot completely eliminate the ability to exercise dis- 
cretionary governmental power. 


I. Introduction 


Governments are alleged to be too big, inefficiently operated, and 
unresponsive to the public. The assumption that government activi- 
ties are undertaken to serve the public has fallen into disrepute.’ In- 
stead, positive analyses of public activities increasingly take the gov- 
ernmental objective to be that of satisfying a bureaucratic goal such as 
expenditure maximization or maximization of the utility of the head 
of the government agency.” Such objectives have been assumed to 
characterize local as well as national governments.’ Indeed, the idea 
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1 See, e.g., Downs (1957) and Sugler (1971) for seminal arguments. 

2 See Posner 1974; Tullock 1974; and Niskanen 1975. 
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that local governments may pursue various goals is implicit in analyses 
of local fiscal variables which assume that the values of these variables 
are dependent on or influenced by the process of collective choice; the 
characteristics of the decisive voter(s) as well as the nature and institu- 
tional form of local governments are presumed to matter.’ 

At the national level, it is often argued that competition via the 
electoral process leaves the bureaucrat considerable latitude because 
voters have limited incentive to learn about candidates or to vote their 
preferences. At the local level, however, it has been argued that 
competition among jurisdictions restricts a bureaucrat’s ability to pur- 
sue policies which do not reflect the residents’ desires. In fact, Tiebout 
(1956) argued that jurisdictional competition would guarantee the 
same efficiency in the provision of local public goods that competition 
among firms assures in private goods markets. Tiebout’s argument 
thus appears to imply that the ballot box is unnecessary, that efficient 
provision of local public services will arise because residents can vote 
with their feet. If, in fact, jurisdictional competition leads to efficient 
local goods provision, then analyses of the political choice process 
would appear to be applicable primarily in those cases in which the 
relevant set of jurisdictions is small. 

In a recent article, Courant, Gramlich, and Rubinfeld (1979) study 
the choice of fiscal variables by a local government that has been 
captured by its bureaucracy. Their analysis, which focuses upon deci- 
sion making in a single jurisdiction, investigates the extent to which 
the ability of residents to leave the Jurisdiction limits the power of its 
bureaucracy. Their paper provides an interesting analysis of the 
trade-off faced by a public employees’ union in a single jurisdiction 
whose residents are mobile. 

Our analysis focuses on the equilibrium among jurisdictions. We 
seek to determine how changing the number of Jurisdictions (i.e., 
changing the degree of competition among jurisdictions) constrains 
tax and expenditure policies of individual governments. We posit 
alternative assumptions about governmental objectives to examine the 
consequences of jurisdictional competition.® 

In particular, we question whether competition among a large 
number of jurisdictions forces governments to implement policies 
that are in the best interests of their residents. To answer this ques- 
tion, we deliberately consider a rather extreme version of government 


* See Barr and Davis 1966; Borcherding and Deacon 1972; Bergstrom and Goodman 
1973; Denzau and Mackay 1976; Inman 1978; and Romer and Rosenthal 1978. 

5 Courant and Rubinfeld (1978) explore benefit measurement issues in a model with 
equilibrium among jurisdictions. However, neither Jurisdictional competition nor an 
Investigation of implications of alternative governmental objectives is the focus of their 
paper. 
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entrenchment. The government (the mayor, the bureaucracy, or 
however defined) in each jurisdiction is assumed to choose a tax rate 
and level of government spending to maximize government profit. 
The government is limited in its choice of tax instruments by a 
constitution over which it has no control, but there is no political 
mechanism by which residents can affect the tax rate or the level of 
government spending. Residents can, however, move freely among 
jurisdictions. We do not claim that local governments actually maximize 
profits; we do not wish to argue that local governments are entrenched and 
unresponsive to their electorates. Resident voters, via intrajurisdictional pro- 
cesses, may indeed restrain governmental behavior to be in accord with their 
own desires. Without denying this possibility, this paper addresses the question 
of whether interjurisdictional competition can, in itself, accomplish the same 
end. Although increasing the number of jurisdictions reduces each 
government’ ability to levy taxes in excess of expenditures, we dem- 
onstrate that competition among jurisdictions, even with very many 
jurisdictions, is not sufficient to prevent individual governments from 
pursuing policies which are not in the interests of their residents. We 
thus demonstrate that the right of citizens to choose among many 
communities cannot completely eliminate governmental monopoly 
power; that is, competition among governments is not equivalent to 
competition among firms. A further implication is that the political 
choice process can matter. 

The above results are established in a model in which every gov- 
ernment seeks to maximize governmental profit and in which each 
jurisdiction is equivalent in size and location. Once this result has 
been established, it is natural to ask whether governments in different 
jurisdictions can coexist while pursuing different objectives. This 
question is answered affirmatively even when the number of jurisdic- 
tions is very large. In answering this question, we shed additional light 
on the “capitalization hypothesis” and further clarify the interpreta- 
tion of tests of the Tiebout hypothesis. Finally, we explain what would 
happen if all jurisdictions were not homogeneous in size or locational 
desirability or both. One of the implications is that the presence of a 
large number of jurisdictions in one portion of the metropolitan area, 
say, the suburbs, limits the monopoly power of jurisdictions in that 
portion of the metropolitan area but has little effect on the monopoly 
power of jurisdictions in other portions of the metropolitan area, say, 
the central city. 

In the section which follows, the model is introduced and some 
comparative statics are derived for use in the subsequent sections. In 
Section III, governments are assumed to maximize profits, and the 
implications of jurisdictional competition are examined. The conse- 
quences of jurisdictional competition when objectives differ among 
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governments are examined in Section IV. In Section V we investigate 
the case in which jurisdictions of differing size occupy differing por- 
tions of the metropolitan area. A summary and conclusions are pre- 
sented in Section VI. 


II. Equilibrium with Multiple Jurisdictions® 


In a metropolitan area with homogeneous individuals each of whom 
can freely choose the jurisdiction in which he wishes to reside, no 
Jurisdiction can unilaterally create a level of well-being for its resi- 
dents different from that available to residents of the remaining 
jurisdictions (assuming more than one). If, for instance, one jurisdic- 
tion chooses a fiscal package (i.e., tax rate and level of government 
services) which residents find relatively unattractive, they will respond 
by moving to other jurisdictions. As a result of this movement, hous- 
ing prices adjust to compensate for differences in the desirability of 
fiscal packages. In this section, we will set forth more exactly the 
implications of this assumption that individuals can choose their resi- 
dential location. The primary purpose of this section is to derive 
properties of the equilibrium. These properties are used in the 
analyses in subsequent sections. The analyses in this and the following 
section are kept relatively simple by the assumption that all jurisdic- 
tions are alike both in terms of their natural physical amenities and in 
terms of their fiscal variables. 

Consider, therefore, a metropolitan area with a fixed total land 
area, L, in which there are J jurisdictions.” The land area is divided 
equally among the jurisdictions, so that each has a fixed boundary 
encompassing land area L = L/J. Housing is produced by competitive 
firms in the jurisdiction from land and nonland factors via a 
constant-returns neoclassical production function. The price of non- 
land factors is assumed fixed and uniform throughout the met- 


ropolitan area. The housing-supply function in the jth jurisdiction 
can then be written as:8 


A’, = Lhi,(P},), (1) 


° In this and subsequent sections of the paper, we assume an equilibrium exists and 
we investigate the properties of the equilibrium. 

"In Sec. V we show that our results are not changed qualitatively if it is assumed that 
additional land is available at the urban fringe. 

* The housing-supply function is obtained by assuming that housing producers 
choose the ratio of nonland to land inputs to maximize profits. The production 
function is assumed to exhibit constant returns to scale. The ratio of nonland to land 
inputs will be an increasing function of the price of housing, and it follows that the 


amount of housing supplied per unit of land will increase as the price of housing 
increases. 
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where Pj, is the price of housing in the jth jurisdiction and h, is the 
supply function of housing per unit of land. 

The jurisdiction supplies G’ units of a local public good; each 
resident is assumed to receive an equal nonshared amount. The good 
is produced with constant returns to scale from nonland factors, and 
its unit cost will be taken as unity. Thus, the total cost of the good is 
proportional to the number of residents. The jurisdiction finances 
this good by assessment of a flat rate property tax, #, on the market 
value of housing in the jurisdiction. Determination of the levels of G/ 
and # will be discussed in the following sections. 

There are N identical individuals in the metropolitan area and N’ in 
the jth jurisdiction. Each individual has a utility function over gov- 
ernment services, housing, and numeraire goods (B), U(G, h, B), and 
faces the budget constraint, J = Py,h(1 + t) + B. Individuals in each 
jurisdiction choose housing quantity to maximize utility, taking G’ and 
# as given. Let V[G’, P#,(1 + #)] denote the indirect utility function that 
results. That is, 

V(Gi,P},1 + #)) = a O(GEhi It -Pingl 8) |; (2) 
J 
a 
The demand function for housing by an individual in jurisdiction 7, 


given the decision to live in jurisdiction 7, that results from this 
maximization is: 

hi, = hilG’,P}(1 + #)I. (3) 
Note that hg expresses housing demanded per person, unlike h,, 
which expresses housing supplied per unit of land. 

Each individual is free to move among jurisdictions, and his loca- 
tional choice is that which maximizes his utility. In equilibrium, no 
individual has an incentive to move. Thus, utility must be the same in 
all jurisdictions: 


V(G',Pi(1 + #)) =V[Gi,Pi + #)). (4) 
All Ni individuals in each jurisdiction must be housed, so 
Nihi, = Lhi, (5) 


and all individuals in the metropolitan area must live in some juris- 
diction, 


ee (6) 
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We will assume that the government of each jurisdiction, whatever 
its objective, adopts a Cournot-Nash strategy, choosing its G’ and #/ 
treating all other jurisdictions’ choices of G and ¢ as parameters (i.e., 
no attempt will be made to investigate strategies in which one gov- 
ernment attempts to determine how changes in its tax rate or service 
level will affect the choice of tax rates or service levels in other 
jurisdictions). In the remainder of this section, we derive the com- 
parative static response of Pj, and N’ in each jurisdiction when t* and 
G* are varied in any one jurisdiction. For notational convenience, we 
will denote jurisdiction 1 (i.e.,k = 1) as the jurisdiction whose tax and 
spending levels are perturbed.° 

If jurisdiction 1 differentially changes its tax rate while no other 
fiscal parameter is altered, individuals would move into or out of 
jurisdiction 1 and out of or into the remaining J — 1 jurisdictions. 
Changes in housing prices will ensure that equilibrium, as described 
by conditions (4) and (7), is restored. Consequently, totally differ- 
entiating (4) and (7), we obtain the response of prices to a change in 
it; 


—Uph4a(1 + t')dP} + UShi(1 + #)dPi, = BhgPdt',7 #1.° (8a) 


J 8 hs 
Ohi hi) yp; 0(hs/hg) 
Lop, ea a. (8b) 
= 


In order to simplify notation, for each jurisdiction, let 7 represent 
the elasticity of housing demand with respect to the gross-of-tax price 


° Technically, this model is not “closed” for two reasons. First, land has a positive 
price in all jurisdictions and the return on land wealth has not been included as income 
to consumers. Second, if the government obtains tax revenue in excess of expenditures, 
the profit should be counted as income to some agent. Formally incorporating these 
elements requires dealing with wealth effects—which are a nuisance and do not shed 
any light on the issues of interest to us. The return on land wealth could be incorpo- 
rated in one of two ways. Title to land could be created with each consumer-resident 
endowed with shares valued at 1/Nth of the aggregate value of the land in the 
metropolitan area. This would not change our analysis or our results in any essential 
respect. Alternatively, the utility function of consumers could be assumed to display a 
constant marginal utility of the numeraire commodity, i.e., to be of the form U = 
W(G,h) + B. With this utility function, variations in wealth arising either from changes 
in land value or from profits in the government sector would not affect the equilibrium 
in the market for housing within or across jurisdictions. The fact that wealth effects can 
be introduced in these ways without affecting our qualitative results reinforces our 
assertion that they can be assumed away. While closing the model by one or more of the 
devices mentioned above may be an appealing formalism, the added complexity adds 
cumbersome detail without in any way illuminating the issues being addressed. 

© In obtaining (8a) from (4), we have made use of eq. (2). Note that (4) and (2) 
together imply that the direct utility function evaluated at the optimum choice of 
housing must be equal across jurisdictions. Hence we simply use the envelope theorem, 
that residents’ housing demand optimally adjusts to the new gross-of-tax price of 
housing, P;(1 + t), in each community. 
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of housing (n < 0), let & represent the elasticity of housing supply 
with respect to the net-of-tax price of housing (6 > 0), and let y 
represent the elasticity of housing demand with respect to govern- 
ment services. In addition, let 8 represent the functionally deter- 
mined density of population. 

Since in this section all governments are assumed to behave identi- 
cally and adopt the same strategy and all governments and residents 
are identical, we assume that all governments will adopt the same G 
and ¢ in equilibrium. Thus, when ? is perturbed, dP}, = ... = dPi,, and 
the levels of all endogenous variables are the same in all jurisdictions. 
Using these conditions and the elasticities and ratio defined above, we 
may rewrite equations (8) as: 


Cis Me J INO) 0 


dt! hii 1)(0 =") 


aPy P16 ad 
db aceon 0, ite=eeo as chare? 2/2 (9b) 
It is apparent from (9a) that an increase in the number of jurisdic- 
tions increases the sensitivity of Pj, to changes in ¢'; that is, as vi 
increases, dP},/dt! becomes more negative, and in the limit as J > © it 
approaches —P,,/(1 + t). This implies that in the limit the derivative of 
the gross-of-tax price, d[P}#(1 + t')]/dt', approaches zero, so that any 
change in ? is fully offset by a change in P}. As the number of 
jurisdictions becomes large, the price in other jurisdictions is unaf- 
fected by changes in #', as is seen by taking the limit of (9b) as J > ~. 
Under our simplifying assumption of identical governments and 
residents, the comparative static effects on housing prices caused by a 
change in G! can be derived in the same manner as the effects caused 
by changes in ¢’. These comparative statics are: 


BOB gate week eV a se 
dGt = JUgha1 +t) JG(@-—%) ’ 

ar ee A eS oa dl Vs We (10b) 
dG! jJUphal + 1) JG(@—n) 

The first term in (10a) reflects the increase in housing prices arising 
because jurisdiction 1 becomes more desirable to consumers as G1 
rises, so that more consumers wish to enter. The second term indi- 
cates the effect of the complementarity (y > 0) or substitutability (y < 
0) of government services and housing. If they are complementary, 
an increase in G1 causes an increase in housing demand _ per 





(Ya) 





(10a) 


11 Detailed enumeration of the steps in deriving these results and results in sub- 
sequent sections is available on request. 
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consumer—which in turn causes P}, to be bidden up.” The terms in 
(10b) are interpreted similarly. 

As the number of jurisdictions increases, equation (10b) indicates 
that the effects of changes in G! on prices in other jurisdictions go to 
zero. As J > , equation (10a) shows that any rise in G! will be offset 
by a rise in Pj, to keep the consumer-resident’s utility constant. 

The effects of changes in ¢! and G! on the population in jurisdiction 
1, N’, are derived by differentiation of (5) and by use of the results in 
(9) and (10). Performing the necessary differentiation and substitu- 
tions yields: 





UNO) ee No 
- eee (11a) 
NOM SS |e ae lige) Cre Oise n) (11b) 


dG} GY [Caan OeP ara lickh) 


These results are easily summarized. An increase in the tax rate 
causes a decline in the jurisdiction’s population because the fall in the 
equilibrium net-of-tax price (see [9a]) causes a reduction in housing 
supply. By contrast, a rise in government services makes the jurisdic- 
tion more attractive. As a consequence, the population of the jurisdic- 
tion will rise, except in the case where complementarity of housing 
and government services causes an increase in per capita housing 
demand large enough to offset the decline in per capita demand that 
results from the price increase exhibited in (10a). 

It should be emphasized that the comparative static results derived 
above do not depend on any particular objective function for the 
Jurisdictions. They simply characterize an equilibrium in which iden- 
tical jurisdictions pursue identical Cournot-Nash strategies. They can 
thus be used to investigate governmental objectives such as maximi- 
zation of property values, the utility of the median voter, government 
expenditure, or “government profit.” 


III. Profit-maximizing Government 


In this section we assume that each jurisdiction is controlled by a 
monopolist whose objective is to maximize governmental profit, 
defined here as the excess of tax revenue over government expendi- 
ture. We will neither defend nor deny the realism of this govern- 
mental objective; this choice of extreme objective is simply a conve- 
nient vehicle for determining whether jurisdictional competition can 


In their study of incidence, Polinsky and Rubinfeld (1978, p. 257) find that this 
complementarity or substitutability is one factor determining how a change in govern- 
ment services affects land prices and other variables. 
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eliminate the power of local governments to adopt policies which are 
not in the best interests of their residents. In practice, voting by 
residents might prevent such exercise of monopoly power. However, 
our purpose is to determine whether competition among jurisdictions 
is by itself sufficient to prevent the exercise of governmental power. 
Hence, we deliberately assume that the monopolist can set G and t 
without electoral approval. 
Profit to the government of the representative jurisdiction is: 


7 —tPalt. HIP ALC) N(LG)C. (12) 


Superscripts have been dropped since all jurisdictions are identical. 

Differentiating (12) and using (1) and the comparative static prop- 
erties of the functions P,(t,G) and N(t,G), we can write the first-order 
conditions for profit maximization as: 





On _,| Pubs [, _ 1a(1 + 6) , G8 — 16) _ 
a ref Pe eae | arf, | 0. (13a) 


Jil) 
UdJ— sy 8 


ah eer, lean +0) ace 
(13b) 


| Pasha + OiaaClO = | hls | = 0. 


Canceling L, taking limits as J] > », and rearranging the above, we 
obtain: 


Pu eae oo (14a) 





EP eee oT Say 6/117 (14b) 
Us Pra le?) 

From (14b) it is apparent that the marginal rate of substitution of 
government services for numeraire goods departs from the ratio of 
relative prices (which equals one). However, because of the distortion 
that arises from the property tax on housing, even a welfare- 
maximizing government would choose a level of public good supply 
such that the marginal rate of substitution does not equal the price 
ratio. 

From (14a), it is clear, unlike the Cournot result when the number 
of firms is increased, that government monopoly profits are positive 
even when the number of jurisdictions is large. Tax revenue per 
capita, (Pha, exceeds government spending per capita, G, by a posi- 


13 We assume that the government does not consume housing. 
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tive amount, related to the value of housing per capita, Pyhg, and to 
the housing-supply elasticity, 6. Thus competition among govern- 
ments encompassed by fixed boundaries is not sufficient to eliminate 
those governments’ monopoly power. 

For an intuitive understanding of why jurisdictional competition 
does not eliminate governmental power, consider the following sim- 
plified version of the model. Suppose consumers get no utility from 
government services. Obviously, then, the bureaucratic monopolist 
will setG = 0. The condition determining the profit-maximizing value 
of t is then (13a) with G = 0. Suppose further that @ and 7 are constant 
over the relevant range. Then (13a) can be solved for ¢ to yield: 


s JOH) 15 
Cen es 


Taking the limit of (15) as n > —%, it is seen that: 


lina 


17-2 


(16) 


a|- 


Thus, when housing demand is perfectly elastic, the profit- 
maximizing tax rate is determined solely by the elasticity of housing 
supply independent of the number of jurisdictions. With perfectly 
elastic housing demand, the price of housing is exogenous, but bu- 
reaucratic monopolists can still profit by sharing in the rents of the 
landowners whose land is confined to some given jurisdiction. Alter- 
natively, suppose that housing is in perfectly elastic supply. Then, 
from (15): 

Se ye 

lim C= nite hk (17) 

0 iff ake 

With only one jurisdiction, the bureaucratic monopolist can exploit 
the housing-demand elasticity. If there is more than one jurisdiction, 
housing can be expanded indefinitely at constant cost in each juris- 
diction, and the individual jurisdictions are unable to exploit the 
elasticity of the demand curve. 


Next, consider the effects of an increase in the number of jurisdic- 
tions in (15): 


1 
lim ¢ = -. 
Ea ae sna 
Comparing (16) and (18), we see that the effect of having very many 
jurisdictions is similar to the effect of having an infinitely elastic 
housing demand. Competition among jurisdictions prevents a bu- 


reaucratic monopolist from exploiting the demand elasticity but not 
the supply elasticity. 


IMPLICATIONS OF COMPETITION 1207 
Finally, differentiating ¢ with respect to J in (15) reveals that 


dt _ @ =) ,8) 


df Of(@-)- + OF 


An increase in the number of jurisdictions does monotonically de- 
crease the tax rate. 

While these results are surprising, they are easily explained. The 
government of each jurisdiction is given the exclusive right to tax the 
property in that jurisdiction. Since boundaries are fixed, a landowner 
cannot move his land to another jurisdiction by a redefinition of 
boundaries. Fixity of boundaries prevents the tax on land from being 
competed away. Jurisdictions compete for residents, and competition 
among a large number of jurisdictions prevents exploitation of the 
elasticity of housing demand. Jurisdictions in our model here do not 
compete for land; consequently, the existence of a large number of 
Jurisdictions does not prevent exploitation of the elasticity of housing 
supply. 

Although the tax exploits only the elasticity of supply when there 
are many jurisdictions, one should not conclude that the tax merely 
redistributes wealth from landowners to bureaucrats. The tax reduces 
the unit price of housing received by suppliers, thereby reducing the 
supply of housing relative to the no-tax equilibrium. As a result, the 
gross-of-tax price of housing must be higher and per capita housing 
consumption lower than in the no-tax equilibrium. The only excep- 
tion would be if housing supply were perfectly inelastic. For this case, 
the tax would not affect housing production or consumption. 

Our results can be illustrated geometrically if we consider a sim- 
plified version of the model in which G is fixed at zero for all jurisdic- 
tions. Each curve in the right-hand portion of figure 1 shows combi- 
nations of net-of-tax housing prices and tax rates that generate the 
same gross-of-tax price and hence the same level of utility. The 
further to the northeast, the lower is the utility level. Let V° represent 
the highest utility attainable, that level which would occur if ¢ = t® = 0 
in all jurisdictions and competitive market forces determined a net- 
of-tax price of P%. The housing market is depicted in the left-hand 
diagram. Recall that the housing-supply function per unit of land, h,, 
is independent of the number of jurisdictions and of the tax rate. The 
housing-demand function per unit of land is a product of the housing 
demand per person, Ad P (1 + t),G], and the population density, 6. 
Hence each jurisdiction exhibits the same housing price, P%, and the 
same population density, 5°. The zero tax equilibrium in the housing 
market is shown as point Z. 

Now consider the situation in which all governments set the same 
positive tax rate, t'. We show that this lowers utility and results in a 


<0. (19) 
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net-of-tax price lower than P§. If the net-of-tax price were to decline 
sufficiently to reestablish the original utility level, at point W, the 
housing-demand function would be unaltered at 5% gP%(1 + t°)]. But 
with the resulting lower net-of-tax price and an unaltered demand, 
there would be a housing shortage. With a tax rate of ¢1, the housing 
market could clear only if the net-of-tax price rose above the level at 
point W. This requires an increase in the gross-of-tax price, hence a 
decrease in utility below the V° level. If P did not fall, the housing- 
demand curve (not shown) would shift downward since P%(1 + t!) > 
P%(1 + t°) and 6 is unchanged at 6°. This would create a housing 
surplus. Thus, the housing market will clear only if Py falls below PY, 
say, to Pj. Consequently, a feasible equilibrium with tax rate ¢? is 
shown as point X in the housing market and as point Y in the utility 
space. Finally, the tax base per unit of land, with ¢ = ¢1, is found as the 
area OP}, Xhi. 

Now, suppose the points X and Y represent the equilibrium when 
there are many jurisdictions. We illustrate why it would not be in the 
interest of any of these jurisdictions to unilaterally depart from this 
equilibrium. We then illustrate why it would be in the interest of a 
monopoly jurisdiction to choose a higher tax rate. 

Suppose a competitive jurisdiction, say jurisdiction 1, were to uni- 
laterally choose a tax rate, say, t? > ¢!.14 Given residential mobility, the 
utility level cannot differ across jurisdictions. Thus, the gross-of-tax 
price in jurisdiction 1, P?{1 + t?), would equal P#(1 + ¢), which 
implies Pj < P}. Consequently, fewer housing units would be 
supplied in jurisdiction 1. To restore equilibrium in the housing 


‘4 We assume in the diagrammatic exposition that competition among jurisdictions 
does not force all jurisdictions to offer the same tax-service package. This issue is 
investigated in Sec. IV. 
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market, it would be necessary for some residents to move to other 
jurisdictions, lowering the population density in jurisdiction 1, say, to 
5* < 6°. The new hypothetical equilibrium for jurisdiction 1 is illus- 
trated by points S and T. Since it contains but a small fraction of the 
area’s land and population, migration out of jurisdiction 1 would have 
no effect on the remaining jurisdictions. Thus, the hypothetical 
equilibrium for the remaining jurisdictions remains at points X and Y. 

By increasing its tax rate, jurisdiction 1 decreases its tax base per 
unit of land to OP, Sh;. Should the percentage decline in the tax base 
be greater than the percentage increase in the tax rate (as suggested 
by fig. 1), it would not be in the jurisdiction’s interest to increase its tax 
rate. Similarly, were any one jurisdiction to reduce its tax rate below 
t”, the resulting increase in the tax base would be more than offset by 
the decline in the tax rate. Consequently, points X and Y describe the 
equilibrium for competitive jurisdictions, because there are no incen- 
tives for any of the economic agents, governments, residents, or 
housing suppliers to alter their behavior. Note that, in equilibrium, 
there are tax revenues collected by each jurisdiction, even when Ca 
0; jurisdictional competition is not sufficient to eliminate government 
profit. 

Note that any jurisdiction’s incentive to change its tax rate, and 
hence the equilibrium tax rate, depends on the elasticity of the h, 
function. The lower is this elasticity, the greater is the incentive to 
raise the tax rate. This result can be indicated by considering the case 
in which h, is completely inelastic; in particular, assume that h, rises 
vertically through point X. If any jurisdiction were to increase its tax 
rate, any decline in housing demand would not cause a loss of housing 
units; hence the jurisdiction’s tax revenue would increase as long as 
the percentage increase in the tax rate, At/t, is greater than the 
percentage decline in the net-of-tax price, AP n/P y. Since gross-of-tax 
plice: is constant, (P= sAP a)(lite fit Al) = Pall igo) pandacon- 
sequently AP;,,/P, = At/(1 + t + At) < At/t. A similar incentive to 
increase ¢ would face each jurisdiction, and ¢ would be continuously 
forced higher. By altering the supply curve through point X, we can 
easily show that the incentive to raise ¢ decreases with an increase in 
the supply elasticity and that a sufficient increase in elasticity creates 
an incentive to lower the competitive equilibrium tax rate. 

Next, suppose that the metropolitan area has but one monopoly 
jurisdiction. Also assume that the tax rate was initially set att’ and that 
the housing market and utility configurations were initially described 
by points X and Y. If the monopolist were to choose a higher tax rate, 
such as ¢2, this would increase the tax revenue per unit of land, unlike 
the situation for the single competitive jurisdiction, because residents 
cannot escape the monopolist by moving to other jurisdictions. Popu- 
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lation density throughout the monopolist’s jurisdiction would be un- 
changed at 5°. An increase in the tax rate to ¢? has two effects. First, 
assuming momentarily that the net-of-tax price is unchanged at P}, 
the per capita demand for housing declines, since P},(1 + ¢2) > P31 + 
t'). But with population density fixed at 6°, this implies an excess 
supply of housing. Consequently, the second effect is that the net- 
of-tax price must fall, say to P#. The resulting situation is depicted as 
points M and N. Note, though, that the decline in the monopolist’s tax 
base per unit of land, to OP?,Mh3, is substantially smaller than the 
decline previously indicated for a competitive jurisdiction con- 
templating the same tax rate increase. Thus, the monopoly jurisdic- 
tion would find it profitable to increase its tax rate, even though a 
competitive jurisdiction would not. Assuming tax revenue changes 
monotonically with the number of jurisdictions, it follows that tax 
revenue per unit of land, hence total tax revenue, is inversely related 
to the number of jurisdictions. With G fixed, government profit is also 
inversely related to the number of jurisdictions. 


IV. Differing Objectives among Governments 


As we indicated in the Introduction, data for local jurisdictions have 
been used extensively for empirical studies of models of government. 
The implicit presumption of these studies is that competition among 
Jurisdictions does not prevent different governments from pursuing 
different goals or prevent them from offering different tax and 
service packages. In this section, we demonstrate that this presump- 
tion is correct. In so doing, we provide a theoretical justification for 
viewing local jurisdictions as natural laboratories for studying models 
of government. At the same time, we demonstrate that interde- 
pendence among local jurisdictions should not be ignored when such 
studies are conducted. Finally, results in this section make it clear that 
our result in the preceding section, that competition does not prevent 
governmental exercise of monopoly power, is not an artifact of our 
assumption of homogeneous governmental objectives. 

As in the preceding section, we assume that an equilibrium exists, 
and we characterize the properties of that equilibrium as the number 
of jurisdictions becomes large. It is first necessary to derive the com- 
parative static properties of an equilibrium in which tax and service 
packages differ across jurisdictions. Conditions (1) through (7) still 
obtain. The comparative static responses of prices in all jurisdictions 
to a change in the tax rate in an arbitrarily chosen jurisdiction can be 
derived from equations (8). There are J equations in (8) from which 
the J expressions 0P},/0t! can be derived. The derivation differs from 
that employed in Section II only in that jurisdictions are not assumed 
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to adopt identical tax-service packages in equilibrium. By similarly 
perturbing G in some arbitrarily chosen jurisdiction, say, 1, we can 
derive the J expressions for 0Pj,/9G!. Thus, just as in Section II, 
comparative static responses of price in each jurisdiction to changes in 
its own fiscal variables and to changes in other jurisdictions’ tax rates 
and service levels can be derived. 

By investigating the limiting properties of these comparative statics, 
we find that, when the number of jurisdictions in a particular subarea 
becomes very large, the level of utility of residents of any of these 
Jurisdictions is invariant to its choice of tax rate or level of government 
services. The amount of land in any of these jurisdictions is very small 
relative to both the total land area and to the land area of jurisdictions 
having the same amenity level. Consequently, the number of residents 
migrating into or out of any one of these jurisdictions in response to a 
change in its tax rate or level of government services is too small to 
affect the price of housing elsewhere. Hence, utility does not change 
elsewhere. Rather, the migration induces an adjustment of the hous- 
ing price within the jurisdiction and that adjustment ensures that the 
level of utility within the jurisdiction continues to equal the utility level 
available elsewhere. By contrast, if a jurisdiction has a nonnegligible 
share of the land in the metropolitan area, migration induced by a 
change in its fiscal variables is sufficient to change housing prices and 
utility in all jurisdictions. 

While an individual government cannot affect the equilibrium level 
of utility of its residents when the number of jurisdictions is large, tax 
rates and service levels are not similarly constrained. Whether 
changes in these variables are desirable depends upon the govern- 
ment objectives pursued, but it is clear that local governments have 
latitude in choosing these variables. Put differently, the “market” for 
local public goods does not force the government to act as a “tax rate 
taker.” This is illustrated in figure 1. Suppose that there are a large 
number of jurisdictions each of which occupies a small fraction of the 
land in the metropolitan area. Let V be the level of utility realized in 
equilibrium in all jurisdictions. Consider two of these jurisdictions and 
assume, for simplicity of illustration, that both offer the same level of 
government services, G. The jurisdictions offer differing tax rates, t! 
and ?#?, but residents of both jurisdictions enjoy the same level of 
utility, V (shown by the equilibrium points Y and T), and the same 


45 Each of the many jurisdictions faces the constraints as described by (dPj,/dt') = 
(dPj,/dG*) = 0 and by eqq. (1), (2), and (4). If one, and perhaps only one, of these 
jurisdictions should be a profit-maximizing jurisdiction, as defined in the previous 
section, it is relatively easy to show that such a jurisdiction would choose its fiscal 
variables such that conditions (14) are attained. Hence, choice of ¢ and G by profit 
maximization is not inconsistent with jurisdictional competition. 
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gross-of-tax price as all other residents of the metropolitan area. In 
this equilibrium, jurisdiction 2 has a higher tax rate, a lower net-of-tax 
price, and a smaller population than jurisdiction 1. 

By holding the tax rate constant, jurisdictions offering different 
levels of government services can be illustrated by a diagram similar to 
that in figure 1. The diagram would differ only in that G would 
replace ¢ on the right-hand quadrant, and indifference curves would 
slope upward. Jurisdictions offering high levels of G would attract 
more residents than jurisdictions offering low levels of G. As a result, 
the price of housing would be higher in jurisdictions offering higher 
levels of government services. Competition will require that utility be 
the same in all jurisdictions, but equality of utility across jurisdictions 
does not require equality of tax rates or equality of levels of govern- 
ment services. 

The finding that, in equilibrium, utility of residents of a jurisdiction 
is independent of its government’s fiscal decisions is a startling result, 
one which highlights an important conceptual distinction between 
residents of a jurisdiction and its landowners. For residents who rent 
housing, the choices of tax rate and government spending level are 
matters of complete indifference in equilibrium. Housing prices will 
adjust to maintain their equilibrium utility level. By contrast, the 
wealth of landowners is directly affected by the choice of tax rates and 
level of government service. A resident landowner would clearly 
support a governmental objective of property value maximization. 
Such an objective would maximize the wealth of the resident as 
landowner, and, given his mobility, it would not diminish his utility as 
a consumer of government services and housing. 

The results in this section also have implications for tests of the 
Tiebout hypothesis. In proposing his test of the Tiebout hypothesis, 
Oates (1969) argued that differences in tax rates or in the level of 
public services provided would be capitalized in housing prices. Edel 
and Sclar (1974) argued precisely the opposite—that in a long-run 
Tiebout equilibrium, tax rates and government spending levels would 
be uncorrelated with housing prices. In the model developed here, 
Oates’s interpretation is clearly correct. In equilibrium, differences in 
tax rates and levels of government services will result in differences in 
housing prices across jurisdictions. This result does not depend on the 
existence of a large number of jurisdictions, nor does it depend on the 
objectives being pursued by individual governments.'* 


_ ‘*It does, however, depend on the assumption that governments do not directly 
intervene into housing markets (e.g., via zoning) to induce individuals to consume more 
housing than they would otherwise choose. See Epple (1980) for further discussion of 
this issue. 


IMPLICATIONS OF COMPETITION 1213 


The test proposed by Oates may be interpreted as a test of equilib- 
rium condition (4). In equilibrium, with homogeneous residents, the 
maximum utility level individuals can achieve will be the same 
everywhere. Denote this utility level V. Setting V[G',Pi,(1 + t)] =V 
and solving for Pi, we obtain an equation of the form estimated by 
Oates: 


ie) 
Pir a naar) ak (20) 


In any given equilibrium, V is the same across jurisdictions, and hence 
it is impounded in the constant term in a regression equation. To 
empirically test this equation, it is necessary that one have obser- 
vations on at least two jurisdictions. However, the equation holds 
regardless of the number of jurisdictions. The above result and re- 
sults in our previous paper (Epple, Zelenitz, and Visscher 1978) 
indicate the crucial role of assumptions about movability of jurisdic- 
tional boundaries. In our earlier paper, we argued that housing prices 
must necessarily be equal across jurisdictional boundaries when 
boundaries can be costlessly redrawn. By that argument, the hypothe- 
sis of Edel and Sclar is correct, but Oates’s hypothesis is correct when 
boundaries are fixed exogenously.’” 

When the number of jurisdictions is large, changes by any one 
jurisdiction have no effect on V. Thus, when the number of jurisdic- 
tions is large, differentiation of (20) provides equations that correctly 
indicate the marginal change in the net-of-tax price of housing which 
will compensate residents when their jurisdiction changes its tax rate 
or service level. When the number of jurisdictions is small, a change 
by any one jurisdiction affects V. In that case, the appropriate expres- 
sion for the price derivatives cannot be deduced from equation (20), 
and, as a result, the compensating alteration in housing price caused 
by a change in tax rate or service level cannot be deduced from (20). 


V. Jurisdictions Differentiated by Size and Location 


In preceding sections we considered models in which land at all 
locations is identical and jurisdictions are identical in size. In this 


17 Note again that the test proposed by Oates is a test of equilibrium condition (4). If 
individuals are costlessly mobile among jurisdictions, if individuals have full informa- 
tion concerning jurisdictions’ alternative fiscal environments, and if jurisdictions do not 
restrict residents’ consumption bundles, then, as Oates clearly saw, equilibrium condi- 
tion (4) obtains. This is not, however, a test of Tiebout’s hypothesis. The latter is a 
normative statement, concerning the achievement of an amount of the public good, G, 
which maximizes individuals’ utilities. As is clear from the analysis in the paper, there is 
no assurance that the utility-maximizing level of G is attained for any individual. 
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section we generalize the model to permit jurisdictions to differ in 
both size and degree of desirability. We argue that our preceding 
results concerning competition hold when jurisdictions are differ- 
entiated by desirability of location. We also discuss how the presence 
of a large number of small jurisdictions in one part of the metropoli- 
tan area affects the monopoly power of a large jurisdiction in the 
same metropolitan area. Finally, we discuss the effects of permitting 
new jurisdictions to form at the urban fringe. 

Since the method of analysis required here is similar to that em- 
ployed in previous sections, our discussion is presented without 
benefit of explicit mathematical argument or proof. Technical details 
are available on request. 

To investigate the effects of differentiation of locations, we assume 
that the total land area L is divided into K subareas or locations 
indexed by k. Each subarea has amenity (or accessibility) in amount 
A,. Variable A is assumed to enter the utility function directly. This 
specification permits investigation of the effects of varying either the 
number of jurisdictions at each location or the number of distinct 
locations. 

Equilibrium conditions analogous to (2) through (7) continue to 
apply. Differences are that amenity level now appears in the utility 
and demand functions. In addition, supply and demand for housing 
must be equilibrated at each location. Comparative static results are 
derived by the same procedure used to obtain equations (9), (10), and 
(11), and governmental objectives are studied using the approach 
employed in Sections III and IV. 

The implications of increasing the number of jurisdictions offering 
a given amenity level are found to be similar to those derived in 
Section III. The presence of a large number of small jurisdictions 
offering the same amenity level prevents any one of these jurisdic- 
tions from exploiting the elasticity of housing demand but not the 
elasticity of housing supply. The profit-maximizing tax rate for these 
Jurisdictions converges to the inverse of the housing-supply elasticity, 
a result analogous to that in (18). Tax revenue per unit of land will be 
higher in locations with a high A, because the price of housing will be 
higher at such locations. This is a consequence of the equilibrium 
condition requiring utility to be constant across Jurisdictions. 

The presence of a large number of small jurisdictions in one part of 
the metropolitan area, however, does not prevent a large jurisdiction 
in another part of the metropolitan area from exploiting both the 
elasticity of housing demand and supply—a result analogous to that 
in (15). 

To interpret these results, think of two locations in a metropolitan 
region, one the central city and the other the suburban area. The 
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above results imply that a large number of jurisdictions in the subur- 
ban region are sufficient to prevent individual suburbs from exploit- 
ing the elasticity of housing demand. But the relatively large propor- 
tion of metropolitan land area occupied by the central city enables its 
bureaucratic monopolist to exploit the elasticity of housing demand, 
even if there are a large number of jurisdictions in the suburban 
region. 

Instead of hypothesizing an increase in the number of jurisdictions 
offering a given amenity level, we might argue that each jurisdiction 
encompasses land with unique attributes. The implications of in- 
creasing the number of distinct jurisdictions occupying a fixed land 
area are found to be the same as those derived in Section III. The 
presence of a large number of small jurisdictions, even if each offers a 
unique amenity, is sufficient to prevent any jurisdiction from 
exploiting the elasticity of housing demand but not the elasticity of 
housing supply. 

Finally, it might be argued that our results in Section III arise 
because we assumed a fixed amount of land available for urban use. 
This conjecture is incorrect. To demonstrate this point, suppose that 
locations are indexed in order of decreasing desirability, so that juris- 
diction K has the lowest amenity level of the existing jurisdictions. 
Land with amenity level 4,,, may then be thought of as the urban 
fringe, which we assume to be in perfectly elastic supply. With these 
assumptions, the utility that can be achieved by choosing land with 
amenity level A,,, is constant at, say, U. 

There are two possible equilibria. All individuals may find the 
existing K jurisdictions more attractive than the urban fringe. In that 
case, the urban fringe is uninhabited and all results are unchanged. 
Alternatively, some individuals may live on the urban fringe. Since all 
individuals must achieve the same level of utility in equilibrium, the 
utility level throughout the metropolitan area is U. However, as dem- 
onstrated in Sections III and IV, an entrenched government that 
must provide its residents an exogenously specified level of utility can 
sull profit by exploiting the elasticity of housing supply. The existence 
of jurisdictions at the extensive margin does not eliminate land rents 
in inframarginal jurisdictions, nor does it prevent the taxation of 
those rents. Therefore, regardless of which outcome occurs, jurisdic- 
tional competition, even when residents can relocate on the urban 
fringe, cannot prevent an entrenched government from exercising 
monopoly power. 

In summary, by the introduction of heterogeneous urban locations, 
we demonstrate new results and establish that the results of Sections 
III and IV hold in a much more general setting. Whether the focus be 
on one urbanized area in isolation or on all urbanized areas within a 
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nation, whenever rents exist, jurisdictional competition does not pre- 
vent an entrenched government from expropriating part of those 
rents. 


VI. Conclusions 


The major finding of this paper is that competition among many local 
jurisdictions is not sufficient to prevent local governments from exer- 
cising monopoly power. While we have derived this finding in a model 
in which government revenues are raised via property taxes, the 
result also follows if communities are empowered to levy a sales tax 
(i.e., a tax on the numeraire commodity), an income tax, or a combi- 
nation of these three taxes. 

The feature of our model that is primarily responsible for this 
result is the assumption of fixed jurisdictional boundaries.'? A gov- 
ernment given taxing powers in a jurisdiction with fixed boundaries 
can exploit the immobility of land and share in the rents accruing to 
that land. Mobility of residents across a large number of jurisdictions 
can prevent individual governments from exploiting the elasticity of 
housing demand but not the elasticity of housing supply. 

These results vindicate researchers seeking to develop a positive 
political theory of local government behavior. Jurisdictional competi- 
tion does not predetermine the outcome; Tiebout does need politics. 
On the other hand, these results highlight the importance of consid- 
ering the economic as well as the political environment of local gov- 
ernments. As shown in Section IV, competition among a large 
number of jurisdictions establishes constraints on the choices made by 
individual governments. Such constraints should not be ignored in 
modeling decision making by local governments. 
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Monetary Expansion and Real Exchange Rate 
Dynamics 





Nissan Liviatan 


Hebrew University of Jerusalem 


This paper examines the effect of a change in the rate of monetary 
expansion on the dynamics of the real exchange rate under fully 
flexible domestic prices and a floating exchange rate. The model is 
similar in structure to an earlier paper by Calvo and Rodriguez, the 
main difference being that in our model consumers have an infinite 
planning horizon and long-run perfect foresight. One of the main 
findings is that an increase in the rate of monetary expansion has the 
impact effect of creating a real appreciation of the exchange rate and 
a deterioration of the balance of payments, contrary to the results of 
Calvo and Rodriguez. 


I. Introduction 


This paper analyzes the effects of increasing the rate of expansion of 
nominal money, pu, on the real exchange rate, ¢, defined as the ratio of 
the nominal exchange rate, E, to the price of home goods P,, for an 
economy operating under a floating exchange rate and fully flexible 
price level for home goods. 

This issue is analyzed in a recent paper by Calvo and Rodriguez 
(1977) (hereafter CR) in the context of a small two-sector economy 
(consisting of traded goods and home goods) whose domestic citizens 
hold both domestic and foreign money, which are the only assets. 
Their conclusion is that an increase in will increase e, that is, cause a 
real devaluation. Consequently, the impact effect of an increase in 
monetary expansion is to induce an increase in the production of 


I am indebted to Jacob Frenkel for pointing out an error in an earlier version of the 
paper and for various comments which helped to improve the exposition. 
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traded goods and to cause an improvement in the balance of pay- 
ments. 

In the present paper we reexamine the problem within the 
framework of utility-maximizing behavior and long-run perfect 
foresight using an approach similar to that of Sidrauski (1967) and 
Brock (1974). It is shown that a central property of the Calvo- 
Rodriguez model, namely, the long-run constancy of total assets, no 
longer holds in the present model. As a result, the impact effect of uw 
tends to result, under our assumptions, in a real appreciation of the 
exchange rate, contrary to the CR result. In the final section of the 
paper the two models are compared in some detail. 


II. The CR Model and Results 


It will be useful to start with a brief description of the CR model which 
deals with two commodities—traded goods (T) and home goods (#). 
The demand functions are 

Cr = Cr(e,a); Cy = Cy(e,a), (1) 


where e = (£/P) is the real exchange rate and a denotes real assets, so 
that 


M 


a (2) 


a=m+ef;m= 


where M is nominal domestic money and / is the stock of foreign 
currency held by domestic citizens. In a full-employment economy 
output depends only on e, so that 


gr = Qrle); du = Qule). @) 


Since the home-goods market is always cleared as a result of flexible 
prices, we have 


Cy(e,a) = Qule). (4) 


By using (1) and (3) we can also express the balance of payments 
F =f = Qrle) ~ Crle,a) (5) 


as a function of e and a. 
Note that by (2) and (4) we can express a as a function of e,a = Wee), 
W’ < 0, so that using (5) we obtain 


f= 6@).. 6 > 0 (6) 


by the usual assumptions. The fundamental feature of this relation- 
ship, from the point of view of the subsequent analysis, is that it holds 
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independently of uw. This implies, of course, that the steady-state 
value of e, derived from ¢(e) = 0, is also independent of uw and hence a 
constant. 

The basic relationship of the assets market is 


eu 0 


where E = (dE/dt)/E (we shall use X to denote [dX/dt]/X), and L’ < 0. 
Since E =é+ 7 (7 = Py) and m = w — a, we can express (7) as 


m : 
j= amEey 8 
é L{ 7) aie (8) 
However, since a and e are functionally related, we can express m asa 
function of e and f. By virtue of (6) and (8) this implies that é can be 
expressed as a function of e, f, and pw, say 


e= ne, fH), (9) 


where yw is dynamically constant. The assumption of perfect foresight 
permits identification of “actual” and “expected” rates of change. 

Equations (6) and (9) form a dynamic system in e and f with w asa 
parameter. As is usual in monetary-growth models with consistent 
expectations, the dynamic system around the steady state corresponds 
to a “saddle point” solution. Along the saddle path of the linearized 
system f follows a linear first-order differential equation, which in 
turn implies that f converges monotonically to the steady-state solu- 
tion. 

Suppose we start at a steady state and increase pu. Let us assume that 
this causes an increase in the steady-state value of f (denoted f). We 
know, however, that in the course of adjustment to the new equilib- 
rium f must increase monotonically at a diminishing rate, as it should 
be for a linear differential equation.! Hence if f, (the initial value of 
[df/dt]) was zero before the change in yw it must become positive as w 
and f increase. However, since he and e are functionally related by (6), 
the increase inf, must be associated with an increase in €4, Which is the 
result proved by CR. In economic terms, in order to support the 
accumulation of f through the balance of payments we need an initial 
real devaluation. This argument can be summarized by 





d 


sign aoe sign 
du 


af apes OFF 
= ee 10 
i sign du ( ) 


_ | The equation for f is one of the form f(t) = f + be, X < 0, and b = fo — f. Hence 
J = Nbe™ and f, = db. After the increase in we have /, = AG — fi) > 0 since A < 0 
and f,; > fo. 


MONETARY EXPANSION 1221 


Since the steady-state value of ¢ is independent of uw and since there 
is a unique relationship between a and e that is consistent with home- 
goods equilibrium, it follows that the steady-state value of assets is also 
independent of the rate of monetary expansion. The higher steady- 
state rate of depreciation that is associated with the higher pw implies 
that in the new steady state a higher proportion of the portfolio will be 
held in the form of foreign exchange so that f is raised. Consequently, 
fi. and e, increase. 

The constancy of the total value of assets across steady states is 
central to the CR thesis. It may be noted, however, that an increase in 
@ represents in some sense a reduction in the return on assets in 
general, which may lead in a different model to reduction in their full 
equilibrium value. In order to focus attention on this aspect we shall 
formulate a long-run perfect-foresight model where the relation be- 
tween the rate of return on assets will have to be balanced against the 
subjective rate of time preference, as in optimal growth models. 


Ill. The Long-Run Perfect-Foresight (LRPF) Model 


We assume that the supply side is the same as in the CR model. The 
difference is in the treatment of demand for commodities and assets. 
We wish to base the consumers’ behavior on utility analysis. In order 
to account for differential demand for m and f we shall follow the 
procedure of including the liquidity services provided by both assets 
in the utility function (in this respect, we extend the Sidrauski [1967] 
approach to two assets). 

An important assumption that we shall use is that the instantaneous 
utility function is separable? as far as consumption of ordinary com- 
modities and liquidity services is concerned. It is this assumption 
which will enable us eventually to derive a relationship between Vs ande 
which is independent of yw, as in (6). The utility integral to be 
maximized by the representative consumer can then be written as? 


J : eTu(cycr) + V(m,ef) dt, (11) 


where wu and V are strictly concave functions and 6 > 0. 

As in CR, there are only two assets—m and /. There are no bonds 
and hence no borrowing. The only way the consumer can spend in 
excess of his income is by running down his stocks of m and f (“bor- 


2 The separability in (11) is stronger than necessary. What is required is that the 
marginal rate of substitution between C7 and Cy in u(-) be independent of the assets 
(i.e., separability in the Leontief sense). ; 

3V(-) can be expressed more generally as V(m,f,e) without affecting our results. 
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rowing from himself”), so that there is an upper bound on his current 
consumption possibilities. This assumption is, in fact, required for the 
existence of a solution. 

The income receipts consist of the value of output q = gy + egr and 
of transfer payments by the government s. We may then express the 
accumulation of assets (a) as 


a@=q+s—IIm - (Il — E)ef — (cy + ecr), (12) 


where [Im and (II — E )ef represent “depreciation” terms. Clearly, when 
(E — II) is positive we may regard (E - IT)ef, or (ef), as an “income” 
term. In fact, in the present model é often plays a similar role to the 
“real rate of return on capital” in monetary-growth models. Using the 
definition a = m + ef we may rewrite (12) as 


@=g+s+ea—(e + i)m— (cy + ce), (13) 


where é represents the “real interest rate” and (é + I) represents the 
nominal one. 

The optimization problem is then to maximize (11) subject to (2), 
(13), and initial assets (a,). It should be noted that from the point of 
view of the maximizing individual e is a market parameter and the two 
assets are fully interchangeable at any instant. From the Euler condi- 
tions we obtain 








Ym —Ve _ 411, (14) 
Un 
Ur _ 
juan (15) 
Ste ee ee (16) 
Un Un 


where subscripts of u and V denote taking partial derivatives with 
respect to the indicated variables (e.g., up = [Ou/Ocr], Vp = [AV/0 (ef)], 
etc.). 

We may then rewrite (14) as 


eile nee oy 
Uy 

which means that along the optimal path the real rate of return on F 
must equal the real rate of return on m (i.., —II) plus a liquidity 
premium. Condition (16) is the usual condition of optimal growth 
models where the role of the “real rate of return on capital” is now 
taken by [(Vp/uy) + é], which equals the real market rate of return on 
F plus the nonpecuniary liquidity element. 
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Let us examine first whether in the present model we may still 
derive an autonomous relationship between f and e. Since in equilib- 
rium Cy = Q,y(e), we can write (15) as 


urlQu(e),cr] a 
lO cle ee) 


which can be solved to yield cp = C7(e). From the balance of payments 
we then have 


f = Qrle) — Cre) = Vee), (18) 


which proves that we still have a functional relationship between f and 
e which is independent of wp. 

The sign of [dC7(e)/de] is generally ambiguous. If, however, we take 
the example of a Cobb-Douglas type of utility function, then (up/uyz) = 
B(cy/cr) (8 a constant), and hence by (18) cr = [BQy(e)/e] so that 
[dC p(e)/de] < 0. The same is true for any homothetic u(-) function. In 
any case, whatever the sign of the latter derivative, we may assume 
that the sign of V’ in (18) is dominated by Q7(e), so that VW’ > 0 (clearly 
if [dC ;/de] < 0 this tendency is only strengthened). Thus the prop- 
erties of V(e) are basically the same as those of (e). 

Since in equilibrium cz can be expressed as a function of e alone and 
since the same is true of cy, it follows that uy and uy are, in equilib- 
rium, functions of e only, say uz(e) and u7(e). This implies that (16) can 
be expressed as 


é = 6(e,m,f). (19) 


By a similar consideration, and using II = w — (m/m), we can express 
(14) as G(m,m,é,e,f,u) = 0. Substituting (19) in the foregoing relation- 
ship and solving for m we obtain 


m = y (emf). (20) 


Equations (18), (19), and (20) form the dynamic system for e, m, and f 
for the economy under competitive equilibrium. 

When we consider the system at some “initial” point of time, then f, 
is momentarily fixed. However, the initial values e, and m, are free to 
be determined by the market since all prices are flexible. We assume 
that in the LRPF (long-run perfect-foresight) model e, and m, are 
determined uniquely so as to set the economy on the “saddle path.” 

In terms of the linearized system around the steady state, we can 
express the solution for any of our variables, say for f, in the form 


ft) =f + beh + dyed! + by, (21) 


where the b’s are determined by the parameters of the system and by 
the initial values, f,, m,, and e,. The fact that e, and m, are determined 
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so as to set the system on the stable trajectory leading to the steady 
state means that two of the three )’s are positive (or have positive real 
parts in the case of complex roots), and therefore their corresponding 
b;’s have to be set equal to zero by appropriate values of e, and m. 
Furthermore, the fact that the “saddle path” is stable means that the 
remaining \ (say A,) is negative. Since on the saddle path 


f(t) =f + bem, (22) 


we see that b, is determined by (0) — f. This leads to the fundamental 
conclusion that the convergence of f(¢) is monotonic, as in the CR 
model. 

We turn now to consider the effect of an increase in pu on the real 
exchange rate (starting from a steady-state position). Since in a steady 
state f = 0, we see immediately from (18) that é is uniquely deter- 
mined, since VY’ > 0. This corresponds, of course, to the slope-of- 
transformation curve between H and T at the point of tangency with 
the indifference curves relating to the two commodities. Thus bt has 
no effect on the long-run real exchange rate. 

In order to analyze the impact effect of 4 on e, we make use again of 
the fact that f = V(e) is independent of yw, and of the property of 
monotonic convergence of f. As we saw in connection with the CR 
model, this implies sign (de/du) = sign (df,/du) = sign (dfidu). Let us 
then determine (df/d), which is the key to the impact effect. The sign 
of this derivative depends critically on the sign of the mixed derivative 
Vmr. The usual assumption in these cases is that V,,7 > 0; that is, the 
two assets behave like “cooperant” factors of production in the theory 
of the firm. 

In steady states é is a given constant, while conditions (14) and (16) 
reduce to 


V i Vp — , 
ee (14’) 
a 
a (16’) 


Differentiating this system with respect to m, f, and yw we obtain the 
following solutions: 


dm _ Vawty, df __ Vim uz 
dp Aw? - dee Agwe (23) 


A= VinnV pr at View: 


Since V pr < 0 and A > 0 by the concavity of V(-) we find that (dm/dp) 
< 0 as expected. We also find that under the “cooperancy” assump- 
tion (df/dw) < 0. Since this refers tof of steady states, it follows from 
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the earlier remarks that (df,/du) and (de,/du) are negative as well. 
Thus an increase in pw leads to an appreciation of the real exchange 
rate contrary to the CR result. 

These results are illustrated graphically in figure 1, where SS rep- 
resents the relationship between f and f along the saddle path, while 
EE represents f = W(e). Note that e increases from right to left on 
the horizontal axis. As u increases we have a new f(f,) and a new S'S’ 
curve passing through it. However, since momentarily f = f,, we 
have f, decreasing from zero to fj. This corresponds on the EE curve 
to a decrease in é, from é@ to e3. The dynamic paths are represented 
by movements along AB and CD. Thus along the dynamic adjust- 
ment path we have real depreciation associated with a negative trade 
balance, but e is still below é throughout. 


IV. A Comparison of the Alternative Models 


The key to understanding the difference in the results obtained in the 
two models is in the different effect of 4 on the steady-state value of f. 
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The negative effect of 4 onf in the LRPF model and its implications 
can be explained as follows. 

In steady states the real rate of return on F, that is, é, is identically 
zero, while the real rate of return on m is given by —. Hence an 
increase in w& reduces the real rate of return on money and therefore 
reduces m. By the cooperancy assumption this reduces the marginal 
utility of f, and consequently its equilibrium value is reduced. Under 
long-run perfect foresight the decumulation of f must start im- 
mediately so that fe decreases. However, a deterioration of the balance 
of trade is compatible only with an appreciation of the real exchange 
rate. Hence (de,/du) < 0. 

Since the cooperancy assumption is crucial for our result, we must 
discuss it in greater detail and interpret it, for the sake of comparison, 
in the context of the CR model. 

Note that (V,/uy) and (Vp/uy) can be interpreted as the subjective 
rates of return on m and F, which will be denoted by pm and pr, 
respectively. Condition (14’) can then be written as Pm — 7 = pr(M= 
7), which indicates that the py must equal p,, after allowing for the 
inflationary depreciation of real balances. Thus we have essentially a 
condition of equality of the “net” rates of return properly defined. It 
can therefore be regarded as a portfolio allocation condition analo- 
gous to condition (7) in the CR model. 

Indeed, in (14’) we have a relationship between m and F in steady 
states that must be consistent with wu (= E), as is the case in (7). Lf we 
accept this analogy between (14') and (7) it seems reasonable to 
require that for a given wu, m and F should be positively related in 
(14’). Writing (14') as 


V n(m,F) 
Uy 


-8=y, (14”) 


we see immediately that a positive relationship between m and F 
implies View > 0; that is, the factors must be cooperant. Thus the 
cooperancy assumption seems to be in line with CR’s equation (7). 

While (14’) relates the rates of return to each other, (16’), which can 
be written as pp(m,F) = 6, pins down the rate of return on F to the 
given subjective rate of time preference. Thus (16') helps to deter- 
mine the “scale” of assets. As / increases and m decreases, we have to 
determine what happens to the value of F which is required to main- 
tain py = 6. In this connection we note that if m and F are positively 
related in (14”), this must be the case for (16’) as well (since Vinz > 0 
and V is concave). 

Note that while (14’) plays a similar role to that of (7) in the CR 
model, condition (16') is the one which replaces their assumption of 
the constancy of m +:F across steady states. In the present model the 
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constancy of the value of total assets is replaced by the condition of 
constancy of the net rate of return, that is,6 = pr = p» — IU, which has 
quite different implications. 

The difference between the models can now be clarified by means 
of figure 2. Suppose that the 4A curve represents the portfolio bal- 
ance condition given by (7) in the CR model and by (14”) in the LRPF 
model. To close the model, Calvo and Rodriguez introduce the con- 
dition that m + F is constant as represented by the HH curve. 

In the LRPF model the system is closed in a different manner, 
namely, by the condition py = 4, that is, by (16’) as expressed by BB. 
The latter curve must be upward sloping since the assets are cooper- 
ant, and it is steeper than AA as a result of the concavity of V(-). The 
intersection of AA and BB represents the solution corresponding to 
(14') and (16’). An increase in w will then shift the long-run equilib- 
rium to Q’ in the CR model (where F increases) and to Q" in the 
alternative one (where F decreases). Unlike the CR model, where total 
assets (m + F) are constant, we must have a reduction of assets in the 
alternative LRPF model. 
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Economies of Scale and Barriers to Entry 





Richard Schmalensee 


Massachusetts Institute of Technology 


Dixit has recently presented a model in which established firms select 
capacity to discourage entry but cannot employ threats they would 
not rationally execute after entry. Entry deterrence in a slight modi- 
fication of this model involves the classic limit-price output. Under 
linear or concave demand, however, the capital cost of a firm of 
minimum efficient scale is an upper bound on the present value of 
the monopoly profit stream that can be shielded from entry. It is 
argued that this suggests the general unimportance of entry barriers 
erected by scale economies. 


In his seminal work on conditions of entry, Bain (1956) argued that 
the necessity for a firm to be large relative to the market in order to 
attain productive efficiency created a barrier to entry. The notion that 
large-scale entry can create a discrete difference between preentry 
and postentry price and profit levels is clear enough, but the details of 
Bain’s argument are still somewhat controversial. In assessing the 
importance of the scale-economies barrier, Bain introduced the 
limit-pricing model of entry deterrence, in which established firms act 
as a perfect cartel and potential entrants expect those firms to main- 
tain their preentry levels of output even after entry. This model has 
been subjected to strong criticism, however, in large part because it 
may not be rational for the established firms to keep output constant 
after large-scale entry has occurred.! Moreover, Stigler (1968) and 
others have challenged the basic idea that scale economies can create a 


I am indebted to Steve Salop, Roger Bohn, and Nalin Kulatilaka for helpful com- 
ments, though I cannot share blame for remaining defects with them. 


"Scherer (1980, chap. 8) provides a useful discussion of the limit-pricing model and 
its critics. 


[Journal of Political Economy, 1981, vol. 89, no. 6] 
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meaningful entry barrier. These critics have stressed the fact that 
once an entrant has invested in an efficient plant, there is no dif- 
ference (under the usual assumptions) between its position and that of 
established firms. Without a postentry difference, they have argued, 
there can be no real barrier to entry. 

In part this last argument concerns the most useful definition of 
“barrier to entry.” Spence (1977), building on insights of Schelling 
(1960), has recently presented an interesting analysis of the economic 
issues involved in that argument.? The basic point that emerges is that 
established firms, assuming they can coordinate their actions, have the 
advantage of being able to make some irreversible decisions before 
new entry appears. In particular, they can select the level of capacity 
facing any new entrant. Even if entry occurs and the established firms 
then wish to have less capacity, their preentry commitments may 
make a rapid reduction in capacity impossible. Recognizing this, en- 
trants may be deterred. In his formal analysis, Spence assumes that 
the established firms build enough capacity to produce nearly the 
competitive output. Before entry, they produce the monopoly output, 
but they threaten to use all their capacity if entry occurs. If this threat 
is believed, as Spence assumes it is, entry can clearly be deterred. Buta 
threat to increase output after entry is surely no more credible than 
the threat to maintain output that is the core of the limit-price model. 

In an important recent paper, Dixit (1980) explores the implica- 

"tions of restricting the established firms to threats that they would find 
it in their postentry interests to execute. In particular, he assumes 
that potential entrants expect the postentry market equilibrium to be 
that of Cournot doupoly. That is, they expect the established firms to 
behave rationally but noncooperatively in response to their entry. 
Though this market model is far from perfect, it at least implies 
plausible profit-seeking behavior by both established sellers and new 
entrants, and it avoids building ad hoc behavioral asymmetries into 
the postentry situation.* Dixit finds that the established firms may still 
be able profitably to deter entry if they can commit to a level of 
capacity before potential entrants appear. 

In Dixit’s simplest model, scale economies arise because of fixed 
costs. In order to relate his analysis to the earlier limit-pricing litera- 
ture and to empirical work on scale economies, it is assumed here 
instead that the long-run average cost curve is L-shaped, so that the 
minimum efficient scale of operations is well defined.4 The resultant 


? See also Dixit (1979), Spence (1979), and, for a good discussion of the basic concepts 
involved, Salop (1979). 

3JIn addition to the discussion of the Cournot assumption by Dixit (1980), see Orr 
and MacAvoy (1965), where it is argued that Cournot behavior is a plausible threatened 
reaction to cheating on a cartel agreement. 

“See, e.g., Modigliani 1958 and Scherer 1980, chaps. 4 and 8. 
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model is analyzed in the next section. It is shown there that entry 
deterrence involves charging exactly the classic limit price before 
entry and holding output constant if entry occurs. A basic difference 
between this analysis and that of Bain-Sylos, however, is that the 
threat to hold output constant after entry is not always credible. It is 
shown that if the threat is credible, and if the market inverse demand 
function is linear or concave, the preentry present value of excess 
profits that can be shielded from entry cannot exceed the capital 
(start-up) cost of a firm of minimum efficient scale. In Section IJ, it is 
argued that this upper bound and widely accepted estimates of the 
importance of scale economies imply that the entry barriers consid- 
ered here are generally of little quantitative importance. The implica- 
tions of localized competition and long-lived advertising in this con- 
text are discussed briefly. 


I. Credible Entry Deterrence 


Following most of the relevant literature, the established firms are 
assumed to be either a single enterprise or a perfect cartel and are 
referred to as “the monopoly” in what follows. Only one potential 
entrant, called “the entrant,” is considered. The monopoly is assumed 
to select and construct its capacity, K”, prior to the entrant’s appear- 
ance. Both the monopoly and the entrant operate under the cost 
conditions shown in figure 1. Economies of scale are such that firms 
with capacity less than Ky have prohibitively high costs, so that K, is 
the minimum efficient scale in this industry. For levels of output, q, 
between K, and capacity, K, variable cost is v per unit. Capacity costs c 
per unit to install. It is assumed to last forever, so that the flow cost of 
Capacity per period is (rc) per unit, where r is the relevant discount 
rate. With capacity K in place, the heavy kinked line in figure 1 is the 
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Fic. 1.—Marginal cost after acquisition of capacity K 
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Fic. 2.—The monopoly’s postentry reaction function 


effective marginal cost schedule. By selecting K”, the monopoly de- 
termines where the kink in its marginal cost function occurs. The 
entrant, with no sunk investment in capacity, faces a constant margi- 
nal cost of ¢ = rc + v for all output levels above Ko. All this follows the 
basic setup of Dixit’s (1980) simplest model, except that he has Ky = 0, 
and he allows nonzero fixed setup costs. 

The first step in the analysis is the description of the monopoly’s 
postentry Cournot reaction function. For iulustrative purposes, sup- 
pose that the industry inverse demand function is given by 


Pe PO) a 10 eget”), (1) 


where a and # are positive constants, P is price per unit, Q is total 
industry output, g” is monopoly output, and q° is the entrant's output. 
If marginal cost were some constant, X, the monopoly’s reaction 
function would be simply 


G7 = "ae bger es) 2b (2) 


Things are more complex under the cost structure assumed here. 
The heavy kinked line in figure 2 shows the monopoly’s postentry 
reaction function given installed capacity K”™ and market demand 
given by (1). 

In figure 2, the line MM’ is the reaction function of a firm choosing 
q™, taking q° as fixed, with constant marginal cost equal to ¢, while NN' 
is drawn for marginal cost v. If q° exceeds U,, the addition of Ky to 
industry output would drive price below average variable cost, v, so 
that the monopoly’s optimal output is zero even with capacity in place. 
For q® between L, and U,, the monopoly optimally sets q” = Ko. Lf 
there were no lower bound constraint on output, a smaller value of q” 
would be chosen along NN’. As q° is reduced below L,, the monopoly 
expands output along NN’ until the capacity constraint is encoun- 
tered. The horizontal section of the reaction function in figure 2 
corresponds to the vertical portion of the marginal cost schedule in 
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figure 1. If g° is sufficiently low, it may (depending on the initial choice 
of K”) be optimal for the monopoly to add capacity and expand 
output, at a marginal cost of ¢ per unit. As q° goes to zero, the 
monopoly’s optimal output goes to M, the unconstrained monopoly 
level, as figure 2 is drawn. If K” were chosen above M but below N, 
q” would be set equal to K” for g° = 0. Monopoly output never ex- 
ceeds N. 

If the monopoly had not acquired any capacity, it would not invest 
if g° were above U,, since the addition of Ky to industry output would 
drive the price below total unit cost, ¢. For q° between L, and U,, it 
would be optimal to set g” = Ko, while for smaller values of q° it would 
be optimal to acquire more than the minimum possible capacity. 
Formally, the key quantity levels discussed above are given in general 
and in the linear case of equation (1) by 


P(L; + Ko) + KyPo(Li + Ko) = t; L, = (a — 2bKy — t)/b, (3a) 


P(U, ARS) = t: U,=(a—bK,—t)/b, (3b) 
De (et lit en atl a ea L, = (a — 26K, — vib, (3c) 
PO Fea kK 5) aU U,= (a SDK gekv be (3d) 


The reaction function relevant to the entrant’s decisions is given by 
the mirror image of the schedule MAB in figure 2. With no commit- 
ment to capacity, the entrant has a constant (long-run) marginal cost 
of t for output levels above Ky. Consideration of figure 2 then makes it 
clear that the entrant will be deterred if it expects the monopoly’s 
postentry output to exceed U;. From (3b), U; is exactly the preentry 
output predicted by the classic Bain-Sylos limit-price theory, with 
P(U,) the corresponding preentry price. 

Figure 3 depicts a situation in which entry is deterred by monopoly 
choice of a capacity level, K”, above the limit quantity, U,. (Note the 
reversal of axes between fig. 2 and fig. 3.) Given its precommitment to 
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Fic. 3.—Credible entry deterrence 
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that capacity, the monopoly’s reaction function is exactly as before; as 
long as q° does not greatly exceed Ko, it is clear that the monopoly’s 
optimal response is to utilize capacity fully. The entrant’s reaction 
function, wa, is the mirror image of the ex ante monopoly schedule 
MAB. Note that with the monopoly’s capacity in place and the corre- 
sponding costs sunk, the conventional Cournot equilibrium point, e, 
has no particular significanee. The only equilibrium is gu K™ qe = 0: 
entry is deterred, and the monopoly always uses its installed capacity 
fully. 

A bit more analysis of the situation depicted in figure 3 makes it 
clear that in order for the monopoly to be able to deter entry and still 
enjoy positive economic profit, two conditions are necessary. First, ¢ 
must be positive; it must be possible for the monopoly to commit 
itself in advance to Capacity, to purchase a lower marginal cost over 
some output range. If c = 0, the MC = v reaction schedule N"N (which 
corresponds to the left-hand portion of NN’ in fig. 2) collapses to 
M'M, and the monopoly cannot avoid the Cournot point, e, which 
involves entry. Second, Ky must be positive; there must be scale 
economies as well as durable capital. If Ky = 0, the entrant’s reaction 
function extends to the point yu’, and K” must exceed p’ in order to 
deter entry. But an examination of the entrant’s reaction function 
makes it clear that P(y’) = ¢t, so that the monopoly can deter entry 
only by producing at least the competitive output in the absence of 
scale economies. (When q° = 0, the entrant’s marginal revenue is 
exactly the market price. Reference to [3b] makes it clear that U, < ee 
as fig. 3 shows.) 

A somewhat stronger necessary condition for entry deterrence with 
positive profits is that the point in figure 3 must lie inside the N’N 
locus. That is, if marginal cost is v, it must be optimal for the 
monopoly to produce at least U; in response to q° = Ky. If not, any 
entrant can plan on producing Ky and earning positive profit, re- 
gardless of the monopoly’s choice of capacity or preentry output. If 
the market marginal revenue curve slopes downward, this condition 
means that the monopoly’s marginal revenue at the point 8 must be at 
least v: 


FeO) erg ( UK, t= oe (4) 

Rearranging, noting that P(U, + Ky) =¢ from (3b), and multiplying by 
Ko, one obtains 

[—P9(U; + Ko)KolU; S cK. (5) 


If the inverse demand function is either linear or concave, weak 
concavity implies 


P(U,) S P(U; + Ko) + Pa(Ut + Ko)LUr — (Ur + Ko)]. (6) 
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Substitution from (6) for the bracketed term in (5) yields 
[B(U,) —21U c= cK: (7) 


Since U; is the monopolist’s minimum (preentry and postentry) deter- 
rence output, condition (7) says that the flow of excess profit enjoyed 
by a monopoly that has credibly deterred entry cannot exceed the 
per-period cost of capital employed in an enterprise of minimum 
efficient scale.° This inequality clearly reflects the fact that both c and 
Ky must be positive for this sort of limit pricing to be credible and 
profitable. 

It should be noted that it may not be optimal for the monopoly to 
install sufficient capacity to deter entry even if it is possible to do so. 
Depending on the details of the demand structure, the monopoly 
might prefer to allow entry and have the equilibrium occur at some 
point along eaB. It can, of course, guarantee itself any such point 
within N'N by its choice of K™. If the monopoly does decide to admit 
the entrant and the latter invests in capacity, inequality (7) serves to 
bound the total excess profit that the resultant duopoly can protect 
from entry by coordinated action.® In general, if entry is unattractive 
to outsiders, the flow of monopoly profits received by cooperating 
insiders cannot exceed the flow cost of capital assets embodied in a 
firm of minimum efficient scale, as long as industry demand is not 
strictly convex. 


II. Extensions and Implications 


The derivation of (7) rests on a number of strong assumptions that 
serve to overstate the ease of deterrence. First, itis at least arguable that 
the Cournot model overstates the intensity of rivalry to be expected in 


>In this model, there would seem to be no general way to obtain a bound on profit, 
like condition (7), without using concavity. (If demand elasticity is a constant, —E < —1, 
[4] is satisfied for all positive Ko if and only if v < rc[E — 1].) Similar bounds hold in 
some related models of entry deterrence. In the “simple illustrative formal model” of 
Schmalensee (1978, pp. 312-13), e.g., if new brands do not expand total demand, 
per-brand excess profit in deterrence equilibrium equals brand-specific fixed cost. With 
output expansion, fixed cost becomes an upper bound. See also the equilibria in the 
references cited in Schmalensee (1978, pee) 3eenee)) 

ee literal given level of capacity is divided among several firms, each will see a larger 
Cournot marginal revenue at capacity output than would a single seller. For any given 
entrant output, capacity is thus more likely to be fully utilized with several established 
firms than with a single monopolist. This means that a higher level of profits can in 
principle be protected against entry by Cournot behavior than the monopoly analysis in 
the text implies. Given the difficulty of coordinating investment policies among several 
firms, however, there seems little to be gained by a detailed analysis. It is surely 
reasonable, though not necessarily correct, to assume that monopoly entry deterrence 
involves higher preentry profits than feasible oligopoly deterrence strategies. 
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the postentry duopoly.” If the entrant is more optimistic, entry will be 
harder to deter, and the flow of excess profits that can be protected 
will be smaller than (7) allows. Second, it was assumed that the 
monopoly was able to invest in infinitely durable assets. The recent 
work of Eaton and Lipsey (1980) supports the intuitive argument that 
the strength of the monopoly’s commitment is reduced as the assets it 
purchases become less durable, since the monopoly cannot generally 
commit itself in advance to invest after entry in order to maintain 
capacity. Third, the assumption that average cost is effectively infinite 
for outputs below Ky is very unrealistic. As the presentations of Mo- 
digliani (1958) and Scherer (1980, chap. 8) show, entry deterrence in 
the Bain-Sylos limit-pricing model is more difficult the more slowly 
average Cost rises when output is reduced below K. The need to deter 
entry at small scale lowers the profitability of the limit-price point, 
which is exactly the point considered in deriving inequality (7). Fi- 
nally, the assumption that the established firms manage to coordinate 
their investment policies perfectly in the interest of entry deterrence 
is very strong indeed. Less than perfect coordination in deterrence 
would be expected to make entry more likely. 

Given the strength of these four assumptions, it would seem sensi- 
ble to use (7) as an upper bound on monopoly profit associated with 
scale-economy entry barriers even in situations where industry in- 
verse demand may be convex. To explore the implications of this 
conclusion, let us consider an industry with (price-level adjusted, net) 
assets A, earning an accounting rate of return r* on those assets. 
Assuming away measurement problems and windfalls, accounting 
profits equal normal returns to capital plus monopoly profits. Let the 
competitive rate of return, the ratio of normal profits to A, be r. It is 
then sensible to use 7 to go from stocks to flows in this industry as is 
done in (7). That condition can then be rewritten as 


ae VA At G, (7’) 
which immediately becomes 
(r* — r)/r = cKo/A. (8) 


Assuming that economies of scale are essentially exhausted at 
capacity Ko, the ratio cK,/A should be a good approximation to the 
ratio K,/Q, the ratio of minimum efficient scale (in output terms) to 
market demand. For most industries, most scholars estimate this ratio 
to be less than 0.10. (Scherer [1980, chap. 4] presents a large number 
of such estimates.) Condition (8) then implies that if an industry has a 
return on assets of, say, 10 percent, entry is not taking place, and only 


7 See the discussion of that model in Scherer (1980, chap. 5). 
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scale economies can serve to deter entry, the normal rate of return 
must be at least 9.1 percent unless scale economies are exceptionally 
important. Scale-economy entry barriers thus cannot account for a 
rate of monopoly profit of even 1 percent, surely not much above the 
standard deviation of the corresponding measurement error in most 
situations. 

An analysis of deadweight loss similarly suggests that entry barriers 
derived from scale economies are not generally important. If industry 
demand is given by (1), it is easy to show that the deadweight loss from 
limit pricing is 6(Ko)*/2. Letting demand elasticity, P/bQ, equal E, the 
ratio of deadweight loss to sales revenue becomes [(K/Q)?/2E]. This 
exceeds 0.005 only if economies of scale are of exceptional impor- 
tance or if E is less than one. 

The analysis so far, based on conventional wisdom about the im- 
portance of scale economies, supports Bain’s (1956, p- 212) conclusion 
that “economies of scale in production and distribution do not loom 
large as the basis of barriers to entry.” Two factors that might weaken 
this conclusion deserve brief mention, however. 

First, most U.S. estimates of the importance of scale economies 
assume national markets and implicitly treat products as homogene- 
ous. If transport costs are important and markets are regional, scale 
economies can become very important indeed. Scherer (1980, p. 98) 
estimates that a cement plant of minimum efficient scale would ac- 
count for about 40 percent of demand in a typical regional market, 
for instance. If products are differentiated, competition may be simi- 
larly localized in product space. As is discussed at length in Schmalen- 
see (1978), this sort of demand structure also serves to magnify the 
importance of scale economies in entry deterrence. 

Second, spending on advertising generally has an investment com- 
ponent, though there is considerable uncertainty about the typical 
durability of the corresponding asset.* The analysis of Section I above 
implies that, by itself, the longevity of advertising’s effects on demand 
does not make it possible to use advertising to protect excess profits. A 
number of authors have suggested that the use of advertising involves 
significant economies of scale, however. There is also considerable 
uncertainty about the importance of these scale effects. If it turns out 
in general or in some industry that scale economies in advertising are 
important and that advertising has relatively long-lasting effects on 
demand, the analysis above would lead one to suspect, by analogy, 


8 Comanor and Wilson (1979) survey the evidence on this point. 
® See Schmalensee 1972, chap. 7; Comanor and Wilson 1979. Spence (1980) provides 


a useful discussion of the definition of economies of scale in this context, though he 
does not treat advertising as an investment. 
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that advertising could be used like investment in plant and equipment 
to protect excess profits from entry. In order to verify this suspicion, 
however, one would at least have to verify that at any instant past 
investment in advertising has the same sort of impact on marginal 
returns to changes in output that investment in capacity does. The 
first of the models discussed in Schmalensee (1974) illustrates that 
advertising does not have such effects in at least some plausible 
dynamic models. "° Rather than attempting to reason by analogy from 
the treatment of investment in capacity here, it would seem preferable 
to construct explicit models involving economies of scale in advertis- 
ing and durability of advertising’s effects on demand and to see if 
those models allow established monopolies to deter entry by means of 
investment in advertising. Such models may even imply simple 
analogs of (7), but that remains to be seen. 
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the government coffee-support policy, which held the income of the 
export sector at a high level and hence enabled the manufacturing 
sector to expand. The fact that industrial output expanded in the 
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1930s. 
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Having presented the model in the first section, we use it as the 
basis of our discussion of industrialization in Brazil in the years of 
the Great Depression. The key to understanding the behavior of the 
Brazilian economy in the 1930s is the government coffee-support 
policy, which held the income of the export sector at a high level and 
hence enabled the manufacturing sector to expand.’ We show that 
stockpiling expanded income, and that, when stockpiling was 
financed by export duties, the trade balance improved, but when it 
was financed by credit, a trade deficit arose. The fact that the currency 
was devalued during periods in which the coffee sector was supported 
through domestic credit expansion but appreciated when coffee pur- 
chases were financed via duties strengthens the hypothesis that de- 
valuations were a response to the external disequilibrium created by 
the coffee policy. The fact that industrial output expanded in the face 
of both real depreciation and appreciation points to the importance of 
expenditures effects, relative to depreciation and price effects, in- 
duced by fiscal and monetary policies during the 1930s. 


I. The Model 


The economy has two sectors: a coffee-producing sector oriented to 
the export market and a manufacturing sector that produces goods 
for domestic consumption and investment. Imported goods compete 
with domestic manufactures on the internal market. 

There are markets for three types of goods: coffee (C), domestic 
manufactures (Q), and imported manufactures (M). The prices of 
these goods in home currency are, respectively, P,, P, and P». The 
nominal wage is denoted W. The world supply of imported man- 
ufactures is perfectly elastic at the international price P¥, so that P» = 
ePx, where e stands for the exchange rate. 

When imported manufactures are used as the numeraire, the real 


' This hypothesis is advanced by Furtado (1963). Furtado is criticized by Pelaéz 
(1972), who contends that a large share of the income generated by the multiplier 
effects of the coffee-support expenditures was canceled by the offsetting negative 
multiplier effects of the taxes levied to finance coffee purchases. Fishlow (1972) argues 
that the coffee-support program generated demand in a somewhat more complex 
manner than Furtado believes. Basing his argument on a partial equilibrium model, 
Fishlow maintains that the greater part of the export duties were passed on to the 
foreign consumer. The transfer of the duties was guaranteed by the inelasticity of 
demand and the high elasticity of supply. Our more complete model incorporates the 
mechanism described by Fishlow, while at the same time analyzing the coffee policy 
from the standpoint of its impact on real income in accord with the approach advised by 
Pelaéz and Suzigan (1976). A monetary explanation for the industrial upsurges during 
the 1930s is found in Pelaéz (1972) and Neuhaus (1973). The monetary factors ex- 
plaining the behavior of economic policy in the Brazilian experience during the De- 
pression are also incorporated in our model. 
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prices are: p, = P./Pm, p = P/P, and w = W/P.». We assume unem- 
ployment and a given nominal wage rate. 


A. The Coffee Market 


In the coffee sector, the fixed factor (suitable land planted in coffee 
and served by adequate transport) can be combined with varying 
quantities of labor to obtain the crop. The grower maximizes profits 
by equalizing the value of the marginal product and a given nominal 
wage. When the price of coffee rises, so do employment and output. 
The coffee supply is, therefore, a rising function of the price of coffee 
in wage units: 


C8 = C%(p,/w). (1) 


Coffee is exported. Foreign demand is a function of the real income 
of the coffee-importing countries, y*, and of the dollar price of coffee 
deflated by the world price of imported manufactures: 


C= OT pe y*), (2) 


where p, = (P,/eP*) and T = 1 + t, witht being the duty rate on coffee 
exports and P, and P% the home price of coffee and foreign currency 
price of manufactures. 

Part of the coffee is bought by the government rather than sold 
abroad. These purchases are financed through foreign loans, domes- 
tic credit, and export duties on coffee. The real coffee purchases on 
the part of the government are designated G. 

Equilibrium on the coffee market requires that? 


C*(p,/w) = C?(p-T, y*) + G. (3) 


Equation (3) determines the real producers’ price of coffee, p,, as a 
function of the exogenous variables:* p, = p.(y*, G, T, w); Op. /dy* 
> 0; 0p-/0G > 0; dp./0T < 0; and Op,/dw > 0. 

We also define real export earnings, X, and real disposable income 
of the coffee sector, y,, measured in terms of imported manufactures, 
respectively, as:4 


It is assumed that domestic coffee consumption is negligible. 
3 From (3): 


ie 1 1 “y*! x ' 

Pe Pea aes + Hy*y*’ + o — Hel”), (4) 
where a prime beside the variable indicates its logarithmic derivative, i.e., x’ = dx/x; 0 = 
dG/C%; €, = price elasticity of the coffee supply; 7, = absolute value of the price elasticity 
of the demand for coffee; and 7,* = income elasticity of the demand for coffee. 
Equation (4) is an approximation. It is exactly correct only if initially G = 0 or C* = C4, 

* The effect of changes in the exogenous variables on real export earnings can be 
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X= Tp-C?(Tpe, y*) = xy; Gylyw); (7) 
No= pcC*(p-lw) = Vely*, G, Le w). (8) 


Real export earnings and real disposable income of the coffee 
sector are increasing functions of foreign income, government pur- 
chases, and real wages. A rise in export taxes raises export earnings 
but reduces disposable income. 

An increase in the real income of the coffee-importing countries, 
y*, enhances external coffee demand and raises the real price of 
coffee, real export earnings, and real disposable income of the coffee 
sector. 

An increase in government coffee purchases boosts the price, and, 
since the demand for coffee is inelastic, export earnings and real 
income of the coffee sector rise. 

An increase in export duties, 7, raises the dollar price and reduces 
international demand for coffee, bringing on a decrease of the real 
price received by the grower. Since the demand for coffee is inelastic 
relative to its price, export earnings rise. But disposable income of the 
coffee sector falls, because both the quantity produced and the price 
received by the grower diminish. 

An increase in the real wage rate lessens coffee supply and pushes 
the real price of coffee up. Both export earnings and real income rise 
inasmuch as demand is inelastic. 


B. The Market for Domestic Manufactures 


The price of domestic manufactures is cost determined. The real 
price of domestic manufactures, in terms of imported manufactures, 
iS 


p = Bu, (9) 

where £B stands for the output/labor ratio. 
Manufacturing output is determined by demand, which depends 
on real domestic income, y, on its price in relation to that of imported 


similars, p, and on real balances, h. Real domestic manufacturing 
output, y,, thus is: 


studied using the following equation: 
es = Ne , , Sr Ee , 
Xu eee lew Sten Oi tcts €.. ) ai eee G (5) 


The aes of changes in the exogenous variables on the income of the coffee sector is 
given by: 


are, 


€o + Ne 


€o + Ne 


Jems’ +o — nF") + JQ. = nodw". (6) 
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Real income is defined as: 


cma) a (11) 


Equations (9), (10), and (11) allow us to determine internal equilib- 
rium and to express real income as a function of the income of the 
coffee sector, real balances, and the real wage.° Given the money 
stock, the fiscal policy, and the money wage, we can depict internal 
equilibrium by an inverse relationship between real income and the 
exchange rate, as illustrated in figure 1 by schedule yy.6 Assume an 
initial combination of the real income and the exchange rate for 
which there is internal equilibrium. An increase in the exchange rate 
reduces the real wage, real balances, and income of the coffee sector. 
Both the reduction in real balances and the reduction in the income of 
the coffee sector contribute to reducing the demand for domestic 
goods. However, the reduction in real wages, and thus in the price of 
domestic goods in relation to imports, increases demand for domestic 
manufactures in substitution for the imported goods. As long as the 
two first effects dominate, a depreciation of the exchange rate reduces 
demand for domestic manufactures. Consequently, industrial output 


° Assuming that, initially,G = ¢ = 0 and X = M, where M stands for imports, we have: 

Ye = X = M and, from eq. (11), 
dylIM = yo + (YqlM) ya (12) 

From eq. (10), 

ya = (1 ~ q)p" + q(dylya) + Aah’, (13) 
where: 7, = absolute value of the price elasticity of the demand for domestic manufac- 
tures; g = marginal propensity to consume domestic manufactures; and \, = elasticity 
of demand for domestic manufactures in relation to real balance. Substituting (13) into 


(12) and observing, from the budget constraint, that (y,/M)(1 — q) = Nm, where 1» is 
the price elasticity of the demand for imported manufactures, we obtain: 


dy/M = [1/(1 — g) lye — mp! + Orgh’), (14) 


where © = y,/M. 

§ Note that h’ = p' = w = —e’, for given nominal money stock and nominal rate. For 
given policy, we also have: y. = ¢-w', where ¢, = (1 — m-)lec/(Ec + Ne)]. We can thus 
rewrite (14) as e’/(dy/M) = —(s + m)/(he + Odq — Nm), Wheres = the marginal propensity 
to save. The schedule yy is downward sloping as long as (¢, + OA,q) > Nn. 
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falls, and real income is reduced. It is implied that a depreciation 1s 
contractionary when the nominal money supply is held constant. 


C. The Balance of Payments 


Imports of manufactures, M, are a function of their price relative to 
that of domestic similar products, of real income, and of real cash 
balances: 


M = M(p,y,h). (15) 


The balance of payments measured in terms of imported manufac- 
tures is 


B=X-M+#F, (16) 


where F stands for independent movements of foreign capital ex- 
pressed in terms of imported goods. : 

External equilibrium, B = 0, can be represented by a positive 
relationship between the exchange rate and real income, as illustrated 
by schedule BB in figure 1.7 If real income increases, the demand for 
imports rises, generating a trade deficit. To restore equilibrium, the 
exchange rate has to depreciate. For the depreciation to improve the 
balance of payments, the reduction in coffee-export revenues has to be 
smaller than the reduction in import spending, obtained through the 
substitution effect, and the reduction in cash balances, induced by the 
depreciation. 

As illustrated in figure 1, internal and external equilibrium prevails 
at E, where yy and BB cross. We shall now illustrate the model by 
presenting some historically relevant comparative static exercises. 


D. Coffee Purchases Financed by Export Duties 


Assuming that G = ¢ = 0, at the outset, the result of the government’s 
increasing expenditures while maintaining a balanced budget (dG = 
dtC”) is as follows. 

Since the demand for coffee is inelastic, the initial impact of gov- 
ernment outlays financed by export duties is to raise coffee prices and 
export earnings,® creating a trade surplus and shifting BB to the 
right,® as in figure 2. On the other hand, as the income of the coffee 


* To obtain the slope of BB, we assume B = 0, X = M, and F = 0, at the outset. We 
thus differentiate (16) totally to obtain e'/(dy/M) = mI(nm + Xm — b¢): m = the marginal 
propensity to spend in imported manufactures; and Xm = the elasticity of demand 
for imports in relation to real balances. 

* Observe that dx = dy, = {(1 + e,)[(1 — Ne)(Ec + Ne) ]}P.dG. 

* The schedule BB shifts to the right in the following proportion: dy = (1/m)dX. 
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sector rises, the demand for manufactures is promoted, and industrial 
production expands (yy shifts to the right).’° At the higher income 
level, there still is a trade surplus."! The higher the marginal propen- 
sity to save, the larger is the balance-of-payments surplus, because less 
expenditures on foreign manufactures will be induced by the higher 
domestic income. The exchange rate appreciates to correct for the 
external surplus. 

We thus conclude that coffee purchases financed by export duties 
are expansionary and induced an appreciation of the exchange rate. 


E. Coffee Purchases Financed by Foreign Loans 


Next, we consider coffee purchases financed by foreign loans. As in 
the preceding case, increases occur in the price of coffee, in export 
earnings, and in income of the coffee sector. Likewise, industrial 
output expands, and domestic income grows. With the increase in 
imports, a trade deficit lower than the government deficit arises. Since 
the government deficit is offset by foreign loans, the balance of 
payments improves. Coffee purchases financed by foreign loans are, 
thus, expansionary and induce an appreciation of the exchange rate, 
ceteris paribus.” 


F. Coffee Purchases Financed by Printing Money 


Consider, now, coffee purchases financed by printing money (p.dG = 
dh). As already seen, the increase in government expenditures on 


© The schedule yy shifts to the right in the following proportion: dy = [1/(s + m)]dy,. 

" At the going exchange rate, the trade surplus can be measured by dB = [s/(s + 
m)]dy,. 

z Open that, when coffee purchases are financed by foreign loans, dX = [(1 — 
Ne)(Ec + Ne)]dG and dy, = [(1 + €-)/(E- + He) ]dG. In terms of our figures, yy shifts to the 
right in the following proportion, dy = [1/(s + m)]dy,; and BB shifts to the right in the 
following proportion, dy = {(1/m)[(1 + €-)/(€e + 1e)]}dF = (1/m)dyg. 
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coffee will raise the price of the product, export earnings, and the 
income of the coffee sector. Observe that the increase in the income of 
the coffee sector, for an increase in government expenditures on 
coffee, is larger than the increase in export revenues since it also 
grows by the government deficit. 

The increase in money balances induces more spending on imports. 
If this effect exceeds the increase in export revenues, even at the 
initial income level, a trade deficit appears, shifting BB to the left,1* as 
shown in figure 3. On the other hand, at the going exchange rate, the 
increases in real money balances and in the income of the coffee 
sector contribute to creating excess demand for domestic goods. Ac- 
cordingly, the equilibrium output level of the manufacturing sector is 
enhanced, and real income grows.'* 

At the going exchange rate, a balance-of-payments deficit appears. 
‘The external disequilibrium is corrected as the exchange rate depre- 
ciates: Domestic goods substitute for imports, and the reduction in 
real balances reduces demand for both domestic and imported goods. 

We thus conclude that financing coffee purchases through credit or 
duties has far different effects for the balance of payments. The 
government can also finance its coffee purchases through a combina- 
tion of duties, credit, and foreign loans. The final effect will certainly 
be an expansion of income, but the effect on the exchange rate then 
depends on the shares of credit, export duties, and foreign loans in 
the total government expenditures on coffee, and on the savings 
propensity. 


II. The 1930s: An Application 


Figure 4 exhibits the Brazilian industrial output expansion during the 
1930s in the face of alternating real depreciation and appreciation. 
The model developed in the previous section asserts that the coffee- 


cS ailie shift in BB can be measured by dy = {(I/m)[(1 — .)(€c + im) — Am(M/h)]}dh. 
'‘* The shift in yy is measured by dy = {[1/(s + m)][(1 + €.)(e. + Ne) + (yql/h)dg]}dh. 
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1929 31 33 35 37 1939 


Fic. 4.—Real industrial output and the real exchange rate, Brazil, 1929-39. Source: 
table 2. 


TABLE 1 


RATES OF DEPRECIATION IN BRAZIL: 1928-39 











Period (Cie Cn Co 
1928-31 .68 
1931-33 = 
1933-35 roy 
1935-37 = {07/ 
1937-39 19 





Source.—Table 3. 


support policy pursued during the 1930s in Brazil accounts for the 
industrial output expansion observed in the period. It also follows 
from the model that the observed appreciation rates, shown in table 1, 
can be attributed to the fact that, in the periods 1931-33 and 
1935-37, stockpiling was mainly financed by export duties, and that 
the observed depreciations, in the years 1933-34 and 1937-39, can be 
attributed to the fact that, during those periods, coffee purchases 
were mainly financed by domestic credit. We further explore these 
facts in this section. 

In the course of the 1920s, the coffee-support program proved 
quite effective. By purchasing unsalable stocks, the government held 
the international coffee price at high levels. Because of this artificial 
stimulus, coffee production nearly doubled from 1925 to 1929 at the 
same time that export remained approximately constant. In 1927-29, 
it was possible to export only two-thirds of the harvest.!° 

When the crisis erupted in 1929, both the federal government and 
the state of Sao Paulo abandoned their coffee-support programs 
because the possibility of external financing had vanished, and the 
Instituto Paulista do Café was in financial straits.1° Between 1928 
and 1930, the coffee price index in cruzeiros fell from 100 to 63.7.0" 


See Furtado 1963, p. 198. 
16 See Silber 1977, p. 189. 
17 See Malan and Bonelli 1977, table 1.2. 
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These facts can be fitted into the model presented in Section I. With 
the relinquishment of the support policy and the drop in the coffee 
price, both export earnings and domestic income decrease. This, in 
turn, lowers industrial output (the industrial production index fell 
from 58 in 1928 to 52 in 1930 [base = 1939])!8 as well as causing 
expenditures on manufactured imports to decrease more than export 
earnings. Even so, the trade surplus’* is outweighed, and the govern- 
ment is forced to devalue the exchange rate in order to deal with the 
balance-of-payments deficit. 

In mid-1930, the Sao Paulo State government managed to arrange 
a foreign loan for the purpose of guaranteeing coffee prices,” though 
at this point the mechanism must be qualified because the year 1930 
was marked by serious political disturbances and a withdrawal of 
capital that contributed to the deterioration of the balance of pay- 
ments as well as to the devaluation of the exchange rate. By assuring 
minimum purchase prices that were profitable for the majority of the 
coffee growers, the program actually maintained the employment 
level of the export economy and, indirectly, of the production sectors 
linked to the domestic market. Thanks to the coffee policy, industrial 
production began to recover from its 1930 bottom. 

In May 1931, the Conselho Nacional do Café was established to 
support the coffee sector by purchasing and destroying stocks. The 
purchases were financed through duties levied on coffee exports and 
funds obtained from the Banco do Brasil and the National Treasury. 
In 1931 and 1932, the coffee bought by the government accounted 
for roughly 30 percent of the export revenues. Between May 1931 
and February 1933, 65 percent of these expenditures were financed 
by duties and the remaining 35 percent by credit from the Banco do 
Brasil and the National Treasury.”! 

As the income of the coffee sector recovered from its lowest levels 
thanks to the price-support policy, the increasing demand for man- 
ufactures led to a rise in industrial output. Since government expen- 
ditures were in part financed by duties, higher income levels were 
compatible with the improvement in the balance of trade and the 
appreciation of the exchange rate. 

Between February 1933 and December 1934, the shares of the 
financial sources changed considerably, with domestic credit coming 
to assume a more important role. To better understand this policy, 
one might glance at figure 3, where the manufacturing sector ex- 
pands in the presence of exchange depreciation. 


8 See Fishlow 1972, table A-1. 


'? Between 1928 and 1930, export earnings fell from US$473.4 million to US$3 19.4 
million, and imports fell from US$383.3 million to US$225.1 million. 

20 See Silber 1977, p. 190. 

A Niexel.,. ja, NER. 
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1929 31 33 35 37 1939 


Fic. 5.—Real income and the real money stock, Brazil, 1929-39. Source: table 2. 


TABLE 2 


REAL Output, REAL MONEY, AND REAL EXCHANGE RATE INDICES FOR BRAZIL: 1928-39 











Index of 
Index of Index of Import Prices 
Real Industrial Index of Money Balances Deflated by 
Output Real Output _—Deflated by the Industrial Prices 
Q y GDP Deflator eP*/P 

Years (1) (2) (3) (4) 
1928 58 65.2 51.5 67.2 
1929 56 65.9 D2) 69.9 
1930 52 64.5 54.1 91.7 
1931 55 62.4 64.8 99.7 
1932 56 65.1 72.1 86.2 
1933 61 70.9 81.2 83.0 
1934 68 77.4 80.2 84.0 
1935 77 79.7 81.8 110.6 
1936 91 89.3 88.0 110.3 
1937 93 Oo 89.2 110.0 
1938 96 97.6 101.9 105.5 
1939 100 100.0 100.0 100.0 





Sources.—Col. 1, Fishlow (1972), table A-1. Col. 2, Haddad (1977), pp. 11, 191. Cols. 3 and 4, table 3. 


TABLE 3 


Money STOCK AND PRICE INDICES FOR BRAZIL: 1928-29 








Money Stock Dollar 





Annual Exchange Import Industrial 
Average Rate Prices Prices 
H e Cbs Ie GDP Deflator 

Years (1) (2) (3) (4) (5) 
1928 5229 44.3 56.7 84.4 102.7 
1929 52.2 44.3 54.6 78.1 98.7 
1930 46.8 48.4 59.4 64.8 86.5 
1931 49.9 74.5 69.2 69.4 77.0 
1932 56.5 73.4 60.1 69.7 78.4 
1933 62.2 66.1 59.4 71.6 76.6 
1934 65.4 73.4 63.0 715.0 81.5 
1935 70.0 90.6 85.7 Vie 85.6 
1936 76.5 90.1 91.3 82.8 86.9 
1937 84.8 83.9 94.8 86.2 95.1 
1938 100.1 91.7 99.0 93.8 98.2 
1939 100.0 100.0 100.0 100.0 100.0 





Sources.—Col. 1, Pelaéz and Suzigan (1976), table A-3. Col. 2, Malan and Bonelli (1977), table 1.3. Col. 3, ibid., 
table A.V.12: this is a tariff-inclusive cruzeiro price of imports. Cols. 4 and 5, ibid., table A.V.14. 
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Between 1935 and 1937, the Departamento Nacional do Café con- 
trolled the export supply through a quota system whereby coffee 
growers were obliged to turn part of their production over to the 
government for the formation of stocks. This was similar to financing 
coffee purchases through duties. During this phase there were less 
financial resources, and the manufacturing sector expanded in the 


presence of a slight exchange appreciation. 
Figures 4 and 5 and tables 1, 2, and 3 summarize the main facts. 
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Confirmations and Contradictions 





Estimating Property Tax Capitalization: 
A Further Comment 


Raymond M. Reinhard 


In recent years, an increasingly popular area of empirical investiga- 
tion has concerned attempts to test Tiebout’s (1956) hypothesis that 
allocative efficiency in the provision of public services may be attained 
through a system of local governments.’ In his seminal contribution to 
the analysis of the Tiebout hypothesis and the capitalization of prop- 
erty taxes, Oates (1969) reasoned that, if consumers are sensitive to 
the public service benefits and property tax costs associated with home 
ownership, these factors should be reflected in housing prices. Until 
recently, most studies of the capitalization phenomenon have fol- 
lowed Oates’s specification in which housing values are regressed on a 
vector of housing characteristics, public service measures, and a tax 
price term.” 

As several analysts have noted,? however, attempts to “test” the 
Tiebout hypothesis by using an Oates-type equation must certainly 
lead to failure. This is so because, in a perfect Tiebout equilibrium, a 
marginal increase in housing value brought about by increased provi- 
sion of public services must necessarily be offset by a marginal de- 
crease in housing value caused by the added tax cost (assuming that 


I am grateful to Lee S. Friedman, Paul R. Portney, Phillip E. Vincent, A. Thomas 
King, and an anonymous referee for their helpful comments. Wallace Oates, Jon C. 
Sonstelie, and Paul R. Portney kindly made their data available to me. 

1 See, e.g., Oates 1969, 1973; Pollakowski 1973; Edel and Sclar 1974; Hamilton 
1976a, 1976b; Meadows 1976; King 1977; Rosen and Fullerton 1977; Epple, Zelenitz, 
and Visscher 1978; Linneman 1978; and Sonstelie and Portney 1980. 

2 4 recent exception is Epple, Zelenitz, and Visscher (1978). 

3 See Edel and Sclar 1974; Hamilton 19766; and Linneman 1978. 
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© 1981 by The University of Chicago. 0022-3808/81/8906-0015$01.50 


1251 


1252 JOURNAL OF POLITICAL ECONOMY 


the public services are fully financed by property taxes). Thus, as 
Linneman (1978) recently observed, “[e]conometrically, an attempt to 
estimate [an Oates-type equation] will be rendered impossible due to 
data matrix singularity in taxes and some linear combination of the 
public goods” (p. 536). 


The Contribution of King 


In a recent issue of this Journal, A. Thomas King (1977) offered two 
significant contributions to the measurement of capitalization effects. 
First, he observed that most prior studies relied on an improper 
specification of the tax price term. As King noted, “[w]hereas the 
hypothesis suggests capitalization of the tax burden, this [Oates’s] equa- 
tion suggests capitalization based upon the tax rate” (p. 426). Second, 
and more importantly, King suggested that the tax price term be 
included as part of the dependent variable. His stated concern was 
with avoiding a possible bias in the coefficient of the tax price term; a 
more cogent rationale for the alternative specification is that it avoids 
the data matrix singularity problem mentioned earlier.* 

Unfortunately, King’s suggested estimation technique is flawed by 
an error of econometrics. A reanalysis of his results, summarized 
below, shows that King’s technique, as originally presented, will lead 
to biased estimates of the true extent of capitalization. The economet- 
ric error, however, is easily remedied; as corrected, the King tech- 
nique appears to offer a significant improvement over the standard 
Oates-type equation. 


The Model 


King suggests that an equation of the following form be used to 
estumate the extent of property tax capitalization: 


V—- btV = Bo te SBiX; = Daye eye (1) 


In equation (1), V represents the market value of the property, t is the 
property tax rate, and X; and P; represent physical attributes of the 
property and measured public services, respectively. The variable b 
measures the extent to which the property tax bill, ¢V, is capitalized 
into the property value, “. . . and is constrained to a value selected 
from the interval (0.1-1.0)” (King, p. 429). King suggests that “a 
maximum likelihood estimate of the extent of capitalization can be 


*In addition, by incorporating the tax price term on the left-hand side of the 


estimating equation, the dependent variable then reflects the actual user cost of the 


residence, while the right-hand-side (independent) variables measure the services which 
flow from the house in a standard hedonic price equation framework. 
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obtained by varying 6 over the interval and reestimating, observing 
changes in R”” (ibid.). Applying this methodology to Oates’s data, 
King concludes that the result of the misspecification in the original 
Oates equations is to overestimate the true extent of property tax 
capitalization. 

Before discussing the econometric error, two points deserve brief 
mention. First, given the range of possible values suggested by King, 
the sign of b should properly be positive, not negative. Recall that, ina 
conventional specification of the capitalization equation (i.e., with btV 
on the right-hand side of the equation), the expected sign of b is 
negative, since an increase in tV should lead to a decrease in V, ceteris 
paribus. Because (—biV) has essentially been subtracted from both 
sides of the equation, the sign of b should therefore be positive. Thus, 
the dependent variable, as redefined, represents the value of the 
property which would have obtained in the absence of any property 
tax, but receiving the same level of public services as before. Obvi- 
ously, this value should be greater than V. 

Second, King’s equation suggests the capitalization of 1 year’s tax 
bill, whereas the hypothesis would suggest the capitalization of the 
present discounted value of the stream of future tax bills. Thus, the 
term btV must be adjusted to reflect the appropriate discount rate, r. 
These two corrections are incorporated into equation (2), below, 
which assumes an infinite time horizon:° 


y+ lV = p,+ XBX: + 3B,P,. (2) 


As noted, King suggests using an iterative technique based on 
observing changes in the level of R’. However, this technique cannot 
be legitimately applied to either his equation (1) or the alternative 
equation (2) without one further adjustment. The difficulty arises 
because the variance in the dependent variable (and, consequently, 
the amount of variation to be explained by the independent variables) 
is increasing as b increases,® thus imparting a bias toward findings of 
low levels of capitalization when the level of R? is used as the 
maximum likelihood criterion. 


5 In private correspondence with the author, King agreed with the validity of these 
two mathematical corrections. However, he described their omission in the text of his 
article as a flaw of exposition, noting that his empirical analysis took these two factors 
into account. 

6In an analogous case of transformation of the dependent variable (involving a 
correction for heteroscedasticity), Pindyck and Rubinfeld (1976) observe: “Note that 
the R? measure associated with the weighted least-squares procedure 1s lower than the 
R? associated with the unweighted procedure. The decline in R? should not be taken as an 
indication that the heteroscedasticity correction was incorrect, since the weighted least-squares 
procedure involves the use of a transformed dependent variable” (p. 102, emphasis added). 
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To facilitate a valid comparison of the level of R? (or, equivalently, 
of the F-statistic) across equations, the estimating equation may be 
reexpressed as follows: 


Bo + LBiX; + XB;P; 


ies (1 + (bt/r)] 


(3) 
With this simple correction, a maximum likelihood estimate of the 
extent of capitalization may legitimately be obtained by varying (6/r) 
and successively reestimating, observing changes in the F-statistic.’ 


Estimating Property Tax Capitalization 


Because Wallace Oates also made his data available to me, it 1s possible 
to compare the results of several alternative specifications of the 
property tax capitalization equation, including the corrected King 
specification just proposed. To review, Oates writes his model of 
property tax capitalization as follows: 


Ve iE Zvi RaNeY oP): (4) 


where V = median home value by town, T = the effective percentage 
tax rate, E = annual educational expenditures per pupil, Z = per 
capita municipal spending on all functions other than local public 
schools and debt service, M = the linear distance in miles of the 
community from midtown Manhattan, R = median number of rooms 
per owner-occupied house, N = percent of houses built since 1950, Y 
= median family income, and P = percent of families in the community 
with an annual income of less than $3,000. To these nine variables, 
King adds a tenth: Tax, the median property tax bill by town, defined 
as the product of T and V. As Oates and King note, in the single- 
equation framework just described, the variables E, Z, and T or Tax 
will be correlated, thereby necessitating the use of two-stage least 
squares (ISLS) to obtain unbiased estimates of their respective 
coefficients. Accordingly, all of the equations were estimated using 
TSLS; the results are presented in table 1. 

Table 1 follows the format of King (1977) in presenting the results 
of the following specifications of the capitalization equation: (Ia) and 
(IIa) correspond to Oates’s (1969) and (1973) estimates, respectively, 
using the property tax rate as the independent variable measuring the 
extent of capitalization; (Ib) and (IIb) correspond to King’s first alter- 
native specification, using the property tax bill as an independent 


In practice, most standard statistical packages do not print the level of R2 precisely 
enough to enable the researcher to identify the maximum likelihood estimate. For this 
reason, the use of the F-statistic, which conveys the same information as R? but in a 
more precise form, is suggested. 
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variable; while (Ic) and (IIc) are the estimates from King’s second, 
erroneous specification. Finally, (Id) and (IId) are the results using 
my corrected version of King’s specification. 

Again following King, computed values for the coefficients of Tax 
are presented for my equations (Id) and (IId), to facilitate compari- 
sons across equations. These values correspond to capitalization of 
100 percent and 145 percent of the property tax bill, respectively.® 
Oates’s estimates put’ capitalization at 66 percent and 92 percent, 
respectively, while King’s estimates from (Ib) and (IIb) indicate 40 
percent and 70 percent capitalization and his (erroneous) estimates 
from (Ic) and (IIc) indicate 63 percent and 67 percent capitalization, 
respectively. Thus, while King finds that Oates’s misspecification of 
the property tax term results in an overestimation of the extent of 
property tax capitalization, my results indicate that this misspecifica- 
tion results in underestimating the extent of capitalization. 

One might reasonably object to the finding of capitalization in 
excess of 100 percent as inconsistent with capitalization theory. How- 
ever, the extent of capitalization depends directly on both the dis- 
count rate (r) and the time horizon assumed. As figure 1 illustrates, 
the corrected King technique is only able to identify the ratio of b to1 
which maximizes the value of the F -Statistic. Thus, the finding of 145 
percent capitalization at a 5 percent discount rate and 40-year time 
horizon is also consistent with, for example, 100 percent capitalization 
at a 2.6 percent discount rate and the same time horizon. And, viewed 
as a homeowner’s Opportunity cost, r would be defined as the interest 
rate on mortgages minus the rate of appreciation of the house. Given 
mortgage interest rates prevailing in 1960 (when Oates’s data were 
gathered), therefore, a finding of 100 percent capitalization at a 2.6 
percent discount rate may be quite reasonable and not at all inconsis- 
tent with the predictions of theory. As further evidence on this point, 
it is interesting to note that, for the equations in both groups (I) and 
(II), the highest values of R2 and of the t-statistics consistently obtain 
under the corrected King specifications, (Id) and (IId). 

Finally, it is quite correct to observe, as did King, that “[iJt can... be 
objected that this equation is inadequate to represent the very com- 
plex bundle known as ‘housing’” (p. 426). Perhaps, as King specu- 
lates, “[t]he instability in the estimated coefficients under different 


* The estimation technique provides a direct estimate of 6, the extent of capitaliza- 


tion; this estimate is then used to compute a coefficient of Tax comparable in inter- 


capitalization of 86 percent and 124 percent, respectively, at 5 percent with an infinite 
time horizon; this is equivalent to capitalization of 100.2 percent and 144.5 percent with 
a 40-year horizon. The present value of a unit stream for 40 years, discounted at 5 
percent, is 17.16. Therefore, 17.16 x 1.002 = 17.19 and 17.16 x 1.445 = 24.80. 
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Fic. 1.—Relationship between the F-statistic, discount rate, and the extent of capitali- 
zation in the corrected King technique (ray a'a’ is a projection of ray aa onto the b-r 
plane). 


specifications noted in the exchange between Pollakowski (1973) and 
Oates (1973) is very likely attributable to the fact that each included 
variable acts as a proxy for a great many omitted but correlated 
variables” (ibid.). To those who would be tempted to cast these results 
in a similar light, however, I should note that the alternative specifica- 
tions described herein were tested on a much larger and more de- 
tailed data set, with much the same results. 

Specifically, the various forms of the capitalization equation were 
applied to a sample of 1,453 single-family houses sold in San Mateo 
County, California between June 1, 1969 and June I, 1970;° because 
observations on suitable instrumental variables were unavailable, all 
equations were estimated using ordinary least squares. The sample, 
however, is noteworthy for at least two reasons: First, it consists of 
observations of actual sales of individual houses, in contrast to Oates’s 
relatively aggregated data. Second, each house is described by a de- 
tailed set of physical characteristics, public services received, and tax 
bills paid (see the Appendix for a full description of the variables 
included). 


® The San Mateo County data were originally gathered for a study of capitalization 
and the Tiebout hypothesis conducted by Jon C. Sonstelie of the University of Califor- 
nia, Santa Barbara, and Paul R. Portney of Resources for the Future (Sonstelie and 
Portney 1980). The author wishes to thank Sonstelie and Portney for kindly making 
their data available. The results reported here are drawn from a larger study conducted 
by the author for the Education Commission of the States (Reinhard 1978). 


1258 JOURNAL OF POLITICAL ECONOMY 


An initial attempt to estimate the extent of capitalization using 
King’s suggested form of the dependent variable (eq. [2], with the 
appropriate corrections for 6 and r) found that the value of the 
F-statistic fell as b was increased; that is, the results implied no capitali- 
zation of the property tax. Inspection of the difference in the residual 
sum of squared errors between the equations in which 6 took on 
values of 0.2 and 1.0 showed a large difference—an indication of 
changing variance in the dependent variable.'° 

Estimating the extent of capitalization with equation (3) yielded a 
maximum value of the F-statistic associated with 100 percent capitali- 
zation at a 3.6 percent discount rate and an infinite time horizon." 
Detailed results of the San Mateo County regression are presented in 
table Al. Finally, the extent of capitalization was also estimated using 
a conventional Oates-type equation, with the property tax rate in- 
cluded as an independent variable. The results (not shown here) 
indicated 31 percent capitalization of property taxes, using the same 
discount rate and time horizon—significantly lower than the estimate 
obtained with the corrected King specification. 


Conclusion 


The results reported here do nothing to detract from King’s observa- 
tion that “[mJany studies of property values have used a clearly incor- 
rect specification of the tax effect, one likely to introduce a bias of 
uncertain direction and magnitude” (p. 430). However, while King’s 
results indicate that the misspecification leads to an upward bias in the 
tax coefficient for his sample, I find that the use of an Oates-type 
equation results in a downward bias in this coefficient, both when 
applied to King’s data and to data from San Mateo County, Califor- 
nia. Further, on the basis of the usual statistical criteria relating to 
goodness of fit (R?) and the robustness of individual parameter esti- 
mates (f-statistics), my corrected version of King’s specification ap- 
pears to offer a significant improvement over the competing alterna- 
tive specifications compared herein. Whether this is a general result is, 
of course, a question for future research. This comment, however, 
clearly underscores the importance of the choice of estimating 
equations in influencing the particular results obtained. 


Manis impression was substantiated by the results of the Goldfeld-Quandt test for 
heteroscedasticity. Test statistic = 1.84, critical value (F with 1,431, 1,431 df =00, w 
df) = 1.0. 

at During the 1-year period in which the houses in the sample were sold, the rate on 
conventional home mortgages was 9 percent. A value of r equal to 3.6 percent implies 


that the houses were appreciating at a rate of 5.4 percent per year, not an unreasonable 
assumption for the houses in the sample. 
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Appendix 
Definitions of Variables,* San Mateo County Data 


ROOMS No. of rooms in the dwelling, excluding bathrooms 
BATH No. of bathrooms 


QUALI Dummy variable indicating average quality of construction 

QUAL2 Dummy variable indicating good quality of construction 

POOL Dummy variable indicating the presence of a swimming pool 

RMSIZE Average room size in square feet 

AGE Age of house 

LOT Size of lot in square feet 

CULSAC Dummy variable indicating location on a cul-de-sac or dead-end 
street 

ALLEY Dummy variable indicating alley behind house 

NUIS Dummy variable indicating presence of a nuisance near property 
(e.g., bus stop, parking lot, church overflow parking, etc.) 

PLAN Dummy variable indicating well-planned neighborhood 

FIRE Insurance Services Office of California rating of the quality of 


local fire protection, based on local department, adequacy of 
water supply, etc. (low rating indicates better protection) 
CRIME FBI Uniform Crime total per 100 residents 


* Variables adapted from Sonstelie and Portney (1980). 


TABLE Al 


REGRESSION RESULTS FOR SAN MATEO COUNTY 











Variable Coefficient t-statistic 
CONSTANT —49,608.65 9.0 
ROOMS 4,698.11 13.0 
BATH 3,088.44 4.0 
QUALI 977.16 ile 
QUAL2 34,598.91 2F3) 
POOL 7,978.64 5.2 
RMSIZE 137.41 14.6 
AGE —213.36 5.7 
LOT 1.142 14.6 
CULSAC 3,010.12 2.6 
ALLEY —2,099.20 lil 
NUIS —2,625.10 5 
PLAN 1,141.64 el 
FIRE —923.08 1.5 
CRIME —1,945.80 207 
REC —430.68 Srl 
ROAD 1,580.84 4.8 
DIST —2,794.35 4.1 
DIST2 108.59 4.5 
AIR —774.01 3.8 
READ 1,468.13 6.2 
EXWADA 29.66 6.6 





Norte.—The variables are defined as in the Appendix. Equation estimated using ordinary least squares; all 
variables except REC are of the predicted sign. Mean of dependent variable = 32,035; F = 1,997.53 (22, 1,431 df); 


R? = .97. 
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REC Recreational expenditures per capita 

ROAD Expenditures per capita on streets and highways 

DIST Linear distance to San Francisco 

DIST2 Linear distance to San Francisco, squared 

AIR Number of days per year with at least 0.10 ppm high hour 
oxidant concentration 

READ Improvement in reading level between first and third grade 


(measured in months) 
EXWADA Expenditures on elementary and high school education per 
pupil in average daily attendance (weighted sum) 
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Tax Changes and Cigarette Prices 


Michael T. Sumner and Robert Ward 
University of Salford 


The recurring question of the difference between specific and ad 
valorem taxes has recently been revived by Barzel (1976), who argued 
(p. 1185) that “. . . when quality is not fully controlled, the effects of 
the two types of taxes differ radically. The excise tax induces a 
substitution of quality for quantity, resulting in a greater increase in 
the price per unit than predicted by the constant-quality model. The 
ad valorem tax induces a reduction in quality and lower after-tax 
price than predicted.” Evidence consistent with these hypotheses is 
found in the response of cigarette prices to tax changes. In the 
further tests reported by Johnson (1978), prices rise significantly less 
than the tax in New Hampshire, where the state tax was ad valorem, 
and significantly more in the remaining states, which impose specific 
taxes. 

The purpose of this note is to question not Barzel’s analysis but 
rather the interpretation of the evidence, on the pedestrian grounds 
of implausibility. As Barzel himself observes (p. 1194, n. 32), “the size 
of New Hampshire makes it unlikely that cigarettes would be tailor- 
made for it. The scope for removing characteristics is quite limited, 
then.” The scope for adding characteristics in other states seems 
equally limited. To avoid misunderstanding it should be emphasized 
that the hypothesized variations in quality are a supply-side phenom- 
enon, not a demand-side response to the relative cheapening of 
superior brands when a specific tax is increased. Johnson’s estimates 
imply that the 9¢ increase in the New Hampshire tax over the period 
1954—76 should have reduced the net price by 3.6¢, ceteris paribus; 
the slightly larger mean tax increase in the remaining 48 continental 
states should have raised net price by 0.9¢. What changes in the 
quality of the services sold with cigarettes could have justified such a 
differential? What are the observable counterparts of the tax 
changes? In the absence of a convincing answer to those questions, the 
estimated response of cigarette prices to taxes represents a fact in 
search of a theory. 
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JOHNSON’S CIGARETTE REGRESSION RESULTS 














Variable Coefficient Standard Error T-Value 

Constant 13.651 439 31.078 

Tait, 1.101 013 87.028 
Nore.—R? = .965; number of observations = 1,162; 22 year dummies, 50 state dummies, and a New Hampshire 


tax dummy were also included in the regression equation. 


The explanation suggested here is that price changes are discon- 
tinuous because of the cost of change itself, the uncertainty of com- 
petitors’ reactions, and the indivisibility of the unit of account. When 
there is a major change in costs, the retail margins eroded by continu- 
ous changes in minor cost components will be restored, so that retail 
price rises by more than the initiating factor; the latter therefore 
appears to exert a disproportionate influence on prices, unless ap- 
propriate allowance is made for the backlog of small increases which 
were not worth implementing in isolation. If inflation is not antici- 
pated, the actual price will lag behind the equilibrium price except at 
the moment of adjustment; if it is anticipated, the actual and equilib- 
rium price levels will coincide on average in the period between 
adjustments.’ The tests reported below cannot discriminate between 
these alternatives but presume that one or other mode of behavior 
prevailed throughout the period. 

To illustrate the relevance of this prosaic suggestion to cigarette 
prices, consider Johnson’s equation: 


Pi; Ly 
Hi, 1, 





+ dummy variables, (1) 


where P is cigarette price in cents per pack, T is tax in cents per pack, 
and II is the consumer price index on a unit base: subscripts 2 and j, 
respectively, denote time and state effects for which the dummy 
variables are entered as controls. His estimates of the critical parame- 
ters are reproduced in table 1. The intercept, which registers the 
effect on nominal cigarette prices of general inflation, implies that 
during the first half of the sample period, when the consumer price 
index was rising very gently, a 1¢ price increase would have occurred 
at average intervals of more than 5 years in the absence of other 
disturbances. Tax changes took place more frequently, so it would not 
be surprising if price changes were concentrated in years when this 
major component of retail price increased. In the second half of the 


' We are indebted to Barzel for drawing our attention to this distinction. 
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TABLE 2 


CIGARETTE REGRESSION ReEsuLTsS: 45 STATES 











Variable Coefficient Standard Error T-Value 
Constant —.034 .050 .670 
All; 8.228 1.270 6.480 
AT .928 .036 25.792 
ty tana AL, 2.469 .669 3.691 
AW; 1.020 malalt 9.151 





Note.—R? = .744, number of observations = 784. 


period general inflation accelerated and tax changes became less 
frequent, so that the modeling of discontinuous price changes be- 
comes less critical. 

One simple method of distinguishing between the direct effect of 
tax changes and their indirect effect of facilitating the restoration of 
margins is to add an additional variable to a first-differenced nominal 
version of Johnson’s basic formulation. The estimating equation pro- 
posed is 


i 
AP;, = aAll, + BATi; + yt; >, ATI, ) 
k=m-+1 

where t;; = 1 if AT;; 4 0, = 0 otherwise. Subscript m indicates the date 
of the previous tax change in state 7. This formulation differs from a 
transformed version of Johnson’s equation (1) in not permitting the 
effect of general inflation to differ across states and time through 
the dummy variables; instead, effects of time specific to the cigarette 
industry were captured by adding the change in wholesale cigarette 
prices (AW) to equation (2), and the residuals were scrutinized for 
state effects. 

Thus augmented, equation (2) was estimated for 45 states which 
impose a specific tax? with the results shown in table 2. The intercept 
is appropriately insignificant and, as anticipated, wholesale price 
changes have a coefficient which does not differ significantly from 
unity. The inclusion of the latter variable reduces the estimated effect 
of general inflation but does not affect the main conclusions. These 
are that the diffused sources of price changes represented by general 
inflation enter both directly and in conjunction with tax changes; and 
that once allowance is made for the indirect influence of tax changes 
in effecting backlogged price increases, the coefficient on tax change 


2 We are grateful to Johnson for providing his data. First differencing and, more 
importantly, the construction of the backlog variable reduced the sample from 1,056 to 
784 observations and eliminated three states in which only a single tax change occurred. 
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TABLE 3 
CIGARETTE REGRESSION RESULTs: 11 STATES 

Variable Coefficient Standard Error T-Value 
Constant .046 .081 -569 
AI; 5.158 ee 2.291 
AT .698 .061 11.442 
ty &tom+1 All; 5.910 1.230 4.805 

i 1.236 .196 6.300 


v 





Nore.—R? = .727, number of observations = 231. 


itself becomes significantly less than unity. Finally, inspection of the 
residuals does not indicate any need to incorporate dummy variables 
for state effects: The largest mean state error is less than half the 
standard deviation of the residuals. 

It was suggested above that the association between tax changes and 
the restoration of margins eroded by mild inflation would be stronger 
the lower the general inflation rate. That suggestion was verified by 
fitting the estimating equation to the sample of 11 states for which 
data begin in 1956, the earliest possible date. The major differences in 
results are immediately apparent in table 3. The most obvious change 
is the lower point estimate of the tax-change coefficient. Second, the 
sum of the coefficients on the catch-up variable and contemporaneous 
price change remains about the same, but their relative size changes 
dramatically: Tax changes are much more important as an occasion 
for effecting backlogged price increases in this sample, in which the 
earlier, more quiescent inflation experience is more heavily weighted. 

The major question remaining is how well the model fits New 
Hampshire. For the period 1955-763 the mean forecast change ex- 
ceeds the actual increase by 13 percent; Theil’s U-statistic is 0.362. 
These figures are difficult to appraise in isolation but do not seem out 
of line with normal forecasting standards. That impression is 
strengthened by the observation that in 10 states the mean residual 
exceeds the average forecast error in New Hampshire. Finally, the 
largest individual error occurs in 1975, when the predicted price 
increase exceeded the actual rise by more than 1.5¢ to yield a propor- 
tionate error of 50 percent. This is particularly inconvenient for 


* To avoid losing early observations additional data were collected from the Tobacco 
Tax Council (1978). The tax series for New Hampshire differs from that provided by 
Johnson in two respects: There is an automatic change in 1957 which does not appear 
in Johnson’s data until 1960; and a single legislated change in 1971 is split between 
1971 and 1972 in Johnson’s data. The revised data are used here. The analysis of the 
forecasts reveals no difference between the impact of legislated and automatic tax 
changes. 
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Barzel and Johnson because the form, as well as the rate, of tax was 
changed in that year from ad valorem to specific. 

For present purposes, the most important result is the negative one 
that the tax-change coefficient is not significantly greater than unity; 
but, from a broader perspective, the conclusion that it is significantly 
less than unity is at least equally interesting. A possible explanation 
for the difference between this coefficient and that on wholesale price 
changes is interstate competition. In this context it is particularly 
noteworthy that the state which lies furthest below the regression 
plane is geographically isolated Maine, where the levels of taxes and 
prices have consistently exceeded those in neighboring New Hamp- 
shire. Of the two price increases not associated with intrastate tax 
changes and exceeding 1¢ in magnitude, one coincided with and the 
other followed a year after tax changes in New Hampshire. Massa- 
chusetts and Vermont also lie below the regression plane and have 
consistently experienced higher tax and price levels than New Hamp- 
shire; in both cases, three of four large price changes unconnected 
with domestic tax changes coincided with tax changes in New 
Hampshire. These selective observations are suggestive but hardly 
constitute evidence. It is, however, beyond the scope of this note to 
construct a scalar measure of state openness and to test its interaction 
with tax changes. 

There remain unexplored possibilities of modeling discontinuous 
price changes more faithfully, but for present purposes it 1s not 
necessary to pursue them. It has been shown that there is no dif- 
ference between the effects of specific and ad valorem taxes on ciga- 
rette prices and that the impact of taxes per se on prices is signifi- 
cantly less than one for one. 


References 


Barzel, Yoram. “An Alternative Approach to the Analysis of Taxation.” J.P.E. 
84, no. 6 (December 1976): 1177-97. 

Johnson, Terry R. “Additional Evidence on the Effect of Alternative Taxes 
on Cigarette Prices.” J.P.E. 86, no. 2, pt. 1 (April 1978): 325-28. 

Tobacco Tax Council, Inc. The Tax Burden on Tobacco: Historical Compilation. 
Vol. 13. Richmond, Va.: Tobacco Tax Council, 1978. 


Miscellany 





“Twas a Night in the Sixties 


"Twas a night in the sixties 
And all through the land 
Unemployment was falling 
Inflation in hand. 


The stock market was rising, 
Without any care, 

In hopes a Dow thousand 
Soon would be there. 


The Keynesians were snuggled 
Secure in their Chairs, 

While visions of multipliers 
Allayed all their cares. 


Paul with his textbook 
And Art with his gap 
Had settled their brains 
For a long postwar nap. 


When out in the land 
There arose such a clatter, 
A voice that was crying 
hat money could matter. 


This study extends earlier work by C. C. Moore, to whom my debt is obvious. It was 
presented at the 1980 Harvard Economics Department Student-Faculty Christmas 
party. Although I am always grateful to the National Science Foundation and National 
Bureau of Economic Research, neither organization provided any financial support 
for this project. 
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Away from their desks 
They flew in a flash 

To see who was claiming 
Such power for cash. 


They looked at their models 
With equations precise, 

That gave semblance of proof 
To conclusions so nice. 


When what to their wondering 
Eyes should appear 

But a miniature sleigh 

With eight tiny reindeer 


With a little old driver 
Who was having such fun 
They knew in a moment 
It must be Milton. 


More numerous than eagles 
His supporters they came 
And he whistled and shouted 
And called them by name. 


First John Say and then Hume, 
Then Marshall and Mill, 

Now Brunner and Meltzer 
And Anna and Phil. 


From the U. of Chicago 

To Minneapolis-St. Paul 
Then dash away! Dash away! 
Dash away all! 


As economic theories 

With which economists play 
When they meet with an obstacle 
Assume it away, 


So off to the journals, 
Their papers they flew, 
With monetarist theorems, 
Rational expectations too. 
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And even in Cambridge 
Was heard the new truth, 
‘The theorems and lemmas 
Of each little proof. 


The Keynesian thinkers 
Were spinning around 
When onto the scene, 
Milton came with a bound. 


He was dressed all in gold 
From his head to his foot 
And his ideas were polished 
And ready to put. 


“Velocity’s stable, 

M1 and M2, 

Which shows what the Fed 
Shouldn’t be trying to do.” 


“That curve by Phillips 
It really is straight 
And the cost of funds 
Is the real interest rate.” 
He wrote many a word, 
And with evidence too. 
At the NBER 

His volumes they grew. 


His ideas how simple. 
He puts them so well. 
It would be no wonder 
When he got his Nobel. 


A wink of his eye 

And a nod of his head 

Soon gave Keynesians to know 
They had something to dread. 


Then he sprang to his sleigh 
To his team gave a whistle 
And away they all flew 

Like the down of a thistle. 
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But I heard him exclaim 
As he drove out of sight, 
“Keep freedom for all, 
And keep money tight.” 


MarTIN S. FELDSTEIN 


National Bureau of Economic Research 
Harvard University 





Policy Research 


The title of this discourse is “Policy Research,” a contradiction in 
terms. Its purpose is to explain that the demise of this once proud 
office was not an accident of circumstance but an inexorable conse- 
quence of a tendency which has been at the heart of political systems 
since the Middle Ages. 

The elements which come together to explain this phenomenon 
were collected in various places. 

The first element appeared at the International Trade Commis- 
sion, where for a year I was privileged to observe both the delibera- 
tions of the commission to reach decisions and the diligent efforts of 
their staff to produce a staff report. But there was no relationship of 
one to the other. Indeed, the place where the commission’s decision 
and the staff report came together was always the printshop. I notice 
that though its purpose was obscure, the staff report did serve a 
function—to misdirect and to obfuscate. Having been exposed to the 
institutionalist tradition, I was confident that this function served 
some purpose, but I was unable to determine what that purpose was. 

Years passed, and I observed that in nearly every town and city of 
Europe, there was a “Jews quarter” or Ruelle des Juifs in the old 
commercial part of town. This treasure, too, I stored in the backyard 
of my mind. 

Finally, a book by Lester Thurow reminded us that zero-sum deci- 
sions are very difficult for a democratic political process to make. Then 
things began to fall into place. The purpose of misdirection and 
obfuscation became obvious. By making the decision process difficult 
to understand, they discourage those with a diffuse or marginal 
interest from participating in the decision; that is, they tend to disen- 
franchise the Opponents to a petition for protection and hence trans- 
form a zero-sum situation into a positive-sum decision—a decision from 
which all the effective participants will gain. 

This, then, is the essence of the decision process. Unless the great 
clamor of procedure confuses one side about what is going on, there 
can be no decision, that is, no policy. 


Presented September 4, 1980, at the U.S. Treasury Department, on the occasion of 


the departure of Finger and the dissolution of the Office of ‘Trade Research, of which 
Finger was the director. 
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Of course, if procedure confuses both sides, the government is in 
the ideal situation of not having to make a decision at all. This occurs 
with some frequency. 

But if one side sees through the obfuscation and the other does not 
(the result which the system seems designed to produce), then the 
aware side’s interests will be disproportionately weighted in the deci- 
sion process, and equilibrium will be “too far” in their direction. In 
summary, the policy process in an open political system 

(1) does not do a lot of things it should do, 
(2) appears to do a lot of things it does not do, and 
(3) overdoes everything it does do. 

Good research which accurately identifies the costs and benefits of a 
course of action is, in any of these avenues, more likely to be embar- 
rassing than welcome. It returns attention to the direct conflict which 
had to be avoided before a course of action—a policy—could be set. 
This means that a person who delivers a novel piece of analytical work 
to his Political Master will be made to feel like a woman who brings a 
new child to her husband—who had a vasectomy 5 years before. ‘The 
Master is less likely to share her joy in the birth of a new thing than to 
decry this evidence of her having conspired with mean spirits. Politi- 
cal responsibility is, you see, the ultimate intellectual vasectomy. 

The history of research in government agencies 1s like the history of 
Jews in pre—Industrial Revolution Europe. The Gentiles of a town 
would decide that they wanted to establish their town as a center of 
commerce. They would then invite the practitioners of this science 
into their town and set aside a place where they might live and 
practice. But, the Jews would practice their trade by rules so basic that 
they would not allow their gentlemen competitors, to whom com- 
merce was a diversion and a livelihood but not a life-style, to turn a 
profit. So, slowly the number of streets open to the Jews would be 
restricted and the number of products they were allowed to trade 
reduced. Or, they would be limited to the wholesale trade, in the hope 
that gentlemen could turn a profit retailing the merchandise the Jews 
gathered. 

But their numbers would dwindle, not so much because they had 
been sent away but because other Gentiles in other towns had decided 
that they should establish centers of commerce and had lured them 
away, to begin the process again. 

As we see across Europe the signs announcing the former presence 
of Jews, we see across organization charts of government agencies the 
street signs of research. But the original citizens of these quarters 
have usually gone, and their places have been taken by reform con- 
gregations, or even Gentiles. Thus, we, the researchers, like the Jews 
of the Middle Ages, are cursed to wander from agency to agency, 
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deprived of home and property by our inability to lose track of what 


we do.! 
J. MICHAEL FINGER 


World Bank 


' The mortal remains (former employees) of the dissolved Office of Trade Research 
were sent away with a memento inscribed “There are no Statesmen in this business. 
Trade theory is about whose hand is in whose pocket and trade policy is about who 
should take it out.” It was noted gratefully that as they departed, their temple would 
be dismantled and the signs in their quarter taken down. Thus they would be spared 
the indignity of seeing a reform congregation move in and eat pork and write briefing 
papers where once they followed the old law. The spirit of the office was also dis- 
patched. It was represented by a full frontal picture of a charging infantry brigade, 
inscribed, “We owe our success to a simple underlying philosophy.” 
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